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Circumferential Gap in a Circular Wave Guide Excited by a Dominant 
Circular-Electric Wave 


LEONARD S. SHEINGOLD* AND JAMES E. STORER 
Cruft Laboratory, Harvard University, Cambridge, Massachusetts 


(Received July 23, 1953) 


The scattering matrix of a 360-degree circumferential slot in an infinitely long circular wave guide with 
an incident dominant circular-electric wave is obtained by a variational principle. The theory which should 
by very good for small gaps is shown to be in good agreement with the obtained experimental results. 





1, INTRODUCTION 


ONSIDER a circumferential gap in an infinitely 
long, perfectly conducting circular wave guide as 
shown in Fig. 1. The gap can be considered to be a 
region of free-space coupling two semi-infinite circular 
wave guides. Assuming that the gap width is small 
compared to the guide wavelength, the basic problem is 
to determine the effect of the gap on an incident TE, 
wave. 

The energy in the TE, wave incident on the gap is 
partially reflected and partially transmitted. Some of 
the energy is coupled through the gap to the free space 
surrounding the wave guide. Since the power relation- 
ships are the essential quantities of interest, the problem 
is formulated in terms of the scattering matrix of a 
wave-guide junction. 

Intuitively, the small gap should have little effect on 
the propagation of an incident circular-electric wave. 
The circular-electric modes have the unique property 
that the wall surface currents are purely transverse. 
These transverse currents do not tend to charge the 
metallic boundary of the circumferential gap; hence 
little energy is expected to radiate into space. On the 
other hand, all 7M modes are characterized by solely 
longitudinal currents on the guide wall, and all TE 
modes other than the circular-electric type have longi- 
tudinal components of current. It can be concluded 


* Now with the Boston Engineering Laboratory of the Sylvania 
Electric Products, Inc., Boston, Massachusetts. The research 
reported in this document was made possible through support 
extended Cruft Laboratory, Harvard University, jointly by the 
Navy Department (U. S. Office of Naval Research), the Signal 
Corps of the U. S. Army, and the U. S. Air Force. 


that the modal energy of all extraneous modes will be 
coupled strongly to the surrounding medium. 

The problem is formulated in terms of an integral 
equation for the axial component of the magnetic field 
on the gap surface. A variational principle is applied 
and a parameter A is introduced representing the sta- 
tionary parameter in the variational formulation of the 
integral equation. For simplification, even and odd ex- 
citations are used and the elements of the scattering 
matrix are expressed in terms of the appropriate A’s. 
The restriction imposed by the theory is that the gap 
width must be small compared with wavelength. 


2. FORMULATION OF THE PROBLEM 


Since the following analysis pertains to circularly 
symmetrical wave-guide modes, the electromagnetic 
field components can be derived from a scalar function 
which is independent of the ¢ coordinate. Assuming a 
time dependence exp—iwt, the circular-electric modes 
are obtained by solving the equation 


LE, (r,z) = 0, (1) 
where the operator L acting on the ¢ components of the 
field is given by 

a? 10 & 


1 
= —f— —}. 4. 2 (2) 
or rdér a2 Pr 


! __} 


Fic. 1. A circumferential gap in an infinitely long 
circular wave guide. 
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Fic. 2. Orientation of cylindrical coordinate system. 


The other field components can be expressed in 
terms of Ey by 


io 
H,(r,2 =— —E,(r,z), 
wy OZ 
—i1d (3) 
H,(r,2) a —rE,(r,2), 
wu r Or 
E,=E,=H,=0. 


The walls of the cylinder are assumed to be perfectly 
conducting; therefore, the boundary conditions at the 
wall require that the tangential components of the 
electric field vanish. Hence, the boundary condition is 


E¢(r,2) Jrma=0. (4) 


A circumferential gap, of width 2g, centered at z=0 
is introduced in the infinite guide as indicated in Fig. 2. 
The region for r<a will be designated by superscript I 
and the region exterior to the guide, r>a, by super- 
script IT. 

In region II (r>a) the field can be expressed in 
terms of a Green’s function and the field distribution on 
the surface of the gap. The Green’s function is defined 
by the equation 


5(r—r’)5(z—2’) 
LG" (r,r’, z—2'/)=— r>a 


r’ (5) 





GU (r,r', z—2’) |rea=0. 


By making use of Green’s second theorem, an in- 
tegral representation of the external field can be found 
in the form 


71 0 
EsM(ra)= [ - —r'G (r,r’, -2)| 
gtr’ Or’ 


E(2')dz’, (6) 


r’=a 
where 


£(2’) _ LE, (7,2") nme: 


While an explicit form for this Green’s function can 
be obtained if desired, it is not necessary. Actually, it is 
simpler to determine the function occurring in the 
integral of (6). It is apparent from (6) that the integrand 
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must satisfy the two conditions 


wl a ~ 
L} — —7’G" (r,r’, z—2’) =0 r>a (7) 


is” Or’ r’=a 


i 
— —7'G" (r,7', z—2') 


— =3(c—-2') (8) 
r Or 








Jr’=a, r=a 


as well as the Sommerfeld radiation condition. By the 
use of the Fourier transform one obtains an expression 
of the form! 


1 a 
- —7'G" (r,7’, 1] 


r Or’ 


1p” A, (2—2)tr] 
=—] ditto ds, (9) 
2rd. A, (#—¥)ta] 





where 
(Re—g*)t=i(e?—k*)! for ¢>k. 


It is immediately apparent that such a choice satisfies 
(7) and (8) as well as the radiation conditions. Since 
the Green’s function is unique, (9) is, therefore, the 
desired function. 

A similar analysis can also be carried out for the 
region I (r>a). Since there are fields propagating inside 
the pipe, the resulting expressions for the fields include 
an additional term representing the incident wave. In 
this case one obtains 


E;! (r,z) = E;} Inc. (r,z) 


71 0 
-{ —- —7'G' (r,r’, -2)] E(2’)dz’. (10) 


, , 
—gt? Or 


Performing an analysis similar to that for region II, 
it is possible to show that 


1 a 
-|- —r'G' (r,r’, -2)] 
r’ Or’ r’=a 

_ 2\4 
_t pr §?) rd 


dnc JiL—-)a] 





gf. (11) 


Assuming only a single mode propagates in the pipe, 
the Bessel function in the denominator of (11) vanishes 
at two places (+71) on the real axis, and at an infinite 
set of points (+iy,) along the imaginary axis. The 
contour C used in (11) is that indicated in Fig. 3. The 
contour is chosen to pass around the singularities in the 
manner shown so that the Green’s function represents 
only waves propagating away from the junction. 

Thus, the entire electromagnetic field has been ex- 
pressed in terms of the field on the surface of the gap. 
It remains now to determine integral equations for 


1C. Papas, Radiation from a Transverse Slot in an Infinite 
Cylinder, Cruft Laboratory Tech. Report, No. 61, 1950. 
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these fields. This can be done by matching the magnetic 
fields across the gap. 

In particular, from (9) the magnetic field in the gap 
surface can be shown (except for dimensional con- 
stants) to be 


10 
H}} (os - —rE3 2) 


ror 


g 
~j K(s—2')k(2!)dz’, (12) 
where _ 


1 i) 
K"(s—s))=— f eit Ade (13) 
; 2rJ_., 
and a wE(k r } 
2 (2 a 
H(¢)= ; (k2— ¢?)!. (14) 
H.C (#— 94a] 





Similarly, from (10) 


1a 10a 
H;} (as)—|- —rE;}! eo) = - —rE¢ tne. 2] 
ror = r or cis 

g 
+f K!(z—2')&(z’)d2’, (15) 

where , ™ 
Ke-s)=— fet s@odr (16) 

2x Cc 


and 


IAM) a] 
Iil(P—-)ta] 
H." (a,2) = H.'(a,z) 





J(§) (k?— 9)! (17) 


Now 
—g<s<g, 





iS. 
4-185 


9-i84 





Fic. 3. Contour C in complex plane. 





@-ib, 
9-id, 


@-i54 





Fic. 4. Contour C in complex plane. 


and, therefore, one obtains the integral equation 


10 g 
-—rE;} (ra) | -{ [K"(z—2’)—K!(s—2')] 


r or 
Xé(2’)ds’ —g<s<g. (18) 
3. VARIATIONAL FORMULATION 


It is now convenient to separate K'(z—z’) into two 
terms, as follows: 


i. 1 
Ki(s—s)=RUe—2)+— f eta, (19) 
2rlc 
where 


. 1p 
Ri(s—s/)=— J eit (2) J (dk, (20) 
2rd _.. 


i.e., integrals along the real axis and similar in form to 
those of (13). The remaining integrals along the con- 
tour C indicated in Fig. 4 can be evaluated by means 
of residues. 

The integral equation then assumes the form 





iy,a° 10 ad 
- —rE; we.) + f cosy1(z—2')&(2’)dz’ 
pr r or r=a -o 
= f K(z—2')é(2’)d2’, (21) 
where 2 


11a - 
K (2—2')=—_[K"™ (z—2')—K'(2—2')] = (22) 
ip? 
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and, the constants pi, y: are defined by 


pi\*}3 
Ji (p1) =0; n=[e-(*) |. 
a 


The fields incident in the wave guide can be ex- 
pressed in the form 
r 
Jif pi- 
Pi a 
4 E,! Inc. (r, 2)= a(z). (23) 
iyia*  Jo(p1) 





Their z dependence, a(z), is, in general, of the form 
a(z) = Ae'n?+ Ben, 


i.e., fields incident on the gap from either the right or 
left. It is convenient to specialize this into even or odd 
incident fields, i.e., 


a,(z)=cosy;z even 
ao(z)=siny;z odd. 


For the even case, &,(z)=£,(—z) and hence, 


, 


f cosy: (z—2')é-(2’)« ails) f ae(2')E.(2') dz’. 


Similarly, for the case of odd excitation £(z)= —£ 
< (—:z) and hence, 


J cosrste— 2’ )Eq(2’)dz’ masts) fi ao (2 )Eo(2’)dz’. 


Therefore, for either even or odd excitation, (23) can 
be inserted in the integral equation (21) which then 
becomes: 


a(a)|1+ f a(ee(e)de' |= f K(z—32’)é(2’)dz’. (24) 


Multiplying (24) by &(z), integrating from —g to g 


and dividing by . 
| f a(a)e(ds| 


yields the variational principle 


1+ f a(s)etos [fixe £(z)E(2’)dzdz’ 


‘its Pisin ithe. 
ff acsecoas | f ‘a(e)eebae] 


Equation (25) can be shown to have all the usual varia- 
tional properties, i.e., A is stationary with respect to 
small changes of ¢(z) about its correct value and A is 
independent of the over-all amplitude of &(z). 





(25) 
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Thus, it is possible to insert an approximate value 
for &(z) in (25) and obtain relatively more accurate 
values of A. As will be seen later, the various A’s can 
be related to the scattering matrix of the junction. 

Referring to Eqs. (13), (14), (17), (20), and (24), it 
is seen that K(z—z’) can be expressed in the form 








K(e—s)= fi etry nnay (26) 
where, using the Wronskian for Bessel functions, 
oo (27) 
v(¢)= ~ (27 
3p;" Jil (P-—?)ta Al (P—@)4ta] 


The variational principles can then be written in the 
form 


- f B (py (dr, (28) 


f costzé,(z)dz 


.()=— . 
f cosy 1z¢,(z)dz 


—@ 


f sinfzéo(z)dz 


f siny12£0(z)dz 


=—@ 


where 





(29) 


Po (F)= 





It remains now to choose trial functions for £(z) and 
insert them into (28). 
Consider the following choices 


E.(z) = (g?—2")8£o(z) = 2(g?@—2*)!. (30) 


It is apparent that these functions satisfy the sym- 
metry conditions and have the correct static depend- 
ence on the edge of the gap. For small gaps they should, 
therefore, be good approximations to the correct fields. 
Inserting these into (29) yields 


wit g) 

tJi(nig) 
(2/68 Wig)— Lol) /¢8)1 
~ [2Fx(vag)/ rag)*]—LJo(rsg)/ rig) 


Inserting (31) into (28), it can be shown (see Ap- 
pendix) that 


amt) 





’.(f)= 











y me 132 
102.2 
yidp;"\ 16 mw \g 
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It is to be noted that these results are only valid for 
small gaps, i.e., g?/A*<1. Results for larger gaps would 
be difficult to obtain theoretically because of the lack 
of satisfactory trial functions for £(z). Moreover, it is 
virtually impossible to make satisfactory experimental 
measurements for large gaps since the interaction of the 
gap with the “free space” region outside the guide is of 
prime importance and a “‘free space”’ region is difficult 
to duplicate experimentally. 


4. TRANSMISSION-LINE REPRESENTATION 


It is now desirable to express the scattering matrix 
elements in terms of A, and Ao. This can be accomplished 
by examining the fields inside the wave guide at a ter- 
minal plane located a great distance from the gap dis- 
continuity. The evaluation of the scattering (or im- 
pedance) matrix requires the knowledge of the explicit 
asymptotic expression of the Green’s function of Eq. 
(11). This expression is obtained in the Appendix. Sub- 
stitution of the asymptotic form into Eq. (10) yields 


E,! (r,z) = E.' Inc. (r,2) 


r 
(0) 
iP a 


ya" J o(p1) 





cinw f e'M?’E(z’)ds’ (33) 


v 


for the electric field in the far zone. 
It then follows that the z dependence is of the form 


g 
E4'(r,2)~a(2)--e-n# f ein’t(a")de’. (34) 


=G 


Now for the even case (a(z)=cosyz) this expression 
becomes 


a 
Batra) | 142 f cosy §(¢')d! fei (35) 


=—@ 
and the even reflection coefficient I, can be defined as 
g 
r= 1+2f cosy12'&(2’)dz’. (36) 
= 


Similarly, for odd excitation a(z)=siny1z, the odd re- 
flection coefficient I’) can be expressed as 


9g 
[yo=—1-—2 J siny2’&(2’)d2’. (37) 


—¢g 


The integrals of Eqs. (36) and (37) can be expressed 
in terms of A from Eq. (25), resulting in 


A.—1 
A+1 

Ao—1 
Aot1 





r= 
(38) 
r'o= = 
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The elements of the scattering matrix are then given 
by 


r.t+To 1 1 
i ta 
r.—To 1 





Siu=S2= 


(39) 








i— ~ ‘ 
2 i+Ay 1+A, 


5. EXPERIMENTAL AND THEORETICAL RESULTS 


Measurements of the characteristics of the circum- 
ferential gap were performed in a 5.755-in. i.d. brass 
tubing at a free-space wavelength of 10.000 cm. 
Deschamps’ method was used to determine the ele- 
ments of the scattering matrix of the circumferential 
gap. In comparing the theoretical and experimental 
results it was convenient to examine separately the 
absolute values and arguments of the scattering matrix 
elements. This method of presentation of data was 
chosen because the experimental values of amplitude 
and phase were determined separately by Deschamps’ 
method.” 

The absolute values of Si, S22, and Sj. are then 
given by 








, , , | A-— Ao| 
Su} = | S22| =- 
| 1+-Ao| |1+A./ 
(40) 
12| = ? 
| 1+ Ao| | 1+A.| 


and their arguments are found to be 

ArgSi11= ArgS22= Arg (A4.— Ao) 
—Arg(1+Ao)—Arg(i+A.) (41) 

ArgSi2= Arg(A,Ao— 1) — Arg(1+Ao)—Arg(1+-A,). 


The experimental values are plotted on the same 
figure as the theoretically determined curves in Figs. 5 
and 6.f It is observed that the experimental value of 
|Si:1| and |Sj2| are in excellent agreement with the 
theory for values of 2g up to 1.000 in. corresponding to 
g/A=0.127. It is recalled that the theory is only 
claimed to be valid for values of (g/A)*1. As seen from 
the phase characteristics of S,,, the experimental values 
lie slightly above the theoretical curve for small values 
of 2g. It is observed in Fig. 6 that there is again ex- 
cellent agreement between experiment and theory for 
values of ArgSi2 such that 2g < 1.000. 


2 Storer, Sheingold, and Stein, Proc. Inst. Radio Engrs. 41, 
1004-1013 (1953). 

+ A detailed description of the experimental techniques as well 
as a more complete discussion of the experimental results is given 
in Cruft Laboratory Technical Report No. 167. It should also be 
noted that in calculating the theoretical values of the scattering 
matrix elements from Eq. (32) —j is substituted for +4 to con- 
form to the convention established in measurement. 
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Fic. 5. Measured and calculated values of S$). 


APPENDIX A. EVALUATION OF THE VARIATIONAL 
PARAMETERS A, AND A> 


The determination of the elements of the scattering 
matrix of a circumferential gap requires the evaluation 
of 


an f &2(¢W (Ode 


(Al) 
hei f HEC (Ode. 


—20 


Since the assumed electric field distribution in the 
gap is a good approximation only for small values of 
vig, the parameters ®,(¢) and &o(¢) can be expressed 
in the approximate form 


_ Wis) | 2J1(&g) 
ci (rig) fg 
2/ (¢g)* Vilég)—LJo(tg)/t¢ 





®.(5) Vg<1 (A2) 








$(f)= 
[2J1(y1g)/ (vig)? ]—LJo(mig)/rng ] 
8 2 Jo(§g) 
=—|—_J(tg)-——}_ -11gK1 
vig | (fg)? fg 
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PHASE OF S,, 
270 
260}-— —_ 
250}+-— — 
Arg S, 
DEGREES 2g 
on > 
oi X = 10.000 CM 
Ag = 18.141 CM 
—— THEORETICAL CURVE 
220|— x EXPERIMENTAL POINTS —+ 
210 -— — 
oot__t__| | | | | 
O 0250 0,500 0.750 1000 1250 1500 1750 
2g INCHES 
and #(¢) is given by 
710" 





¥(o)= , 
mer iL (—&)'a JA (—) 4a] 


It is convenient to determine separately the real and 
imaginary parts of A, and Ao. Proceeding to evaluate 
R.P.A, one observes that the integrand is real only 
for §<k. Therefore, assuming kg<1 as well as yig<1, 
so that Ji(¢g)=fg/2 for 0<¢<k, it follows that 


2y1a* f 
rp? 0 JPL (R 


2y 107 f dt 
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For #1, the denominator is large, and hence there is 
little contribution. On the other hand for ¢ near zero, 
and with ka equal to 4.592, the Hankel function is 
already well approximated by its asymptotic form 
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Fic. 6. Measured and calculated values of Sj. 


Inserting (A5) into (A4) results in 








mika ft (i-f)! 
R.P.A.= f -dt, (A6) 
Tpr 0 1+ (6 1i—F) 
where the quantity 6 is defined as 
3 
8 (ka)? 


Equation (A6) can be written as 


7 k*a® 46 l (1—f)! 
|1-— —— at. (A7) 
Jy 5+(1—#) 
Now since 5X1, it can be neglected in evaluating the 
remaining integral of (A7). One then obtains 
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To evaluate the imaginary part of A,, examine the 
integral 


10° 
I.P.A,=LP. 





wp, 
f L(2/Sg)Ji(¢g) F 
2 JiL(P—$)'a ML (RP —$*) 4a] 


Noting that the integrand is imaginary for values of 
¢>k, and that the major contribution to the integral 
occurs for {>>k, one can write 





dg. (A9) 


ya 0 2 2 
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with |¢a|>>1 
has been used. It then follows that{ 
8yia fa\? c* (ile) P 
L.P.A.= (*) ———-——df 
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mp \g 
In a similar fashion the real part of Ao is given by 


R.P.Ap=R.P. (—) 
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t Watson, A Treatise on the Theory of Bessel Functions (Cam- 
bridge University Press, Cambridge, 1945), p. 397. 
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It is seen again that the integrand has a real part when 
k<¢. Again assuming kg<1, and using the leading term 
in the expansion of the numerator, one obtains 


2a? k ¢ 
R.P.Ao= f dg. (A14) 
rpryn P |H,9C (R—¢)'a]|? 








Again making the substitution ‘=¢/k, 


2(ka)® dt 
R.P.Ao= f . (A115) 
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Introducing the asymptotic expression |H,[(1—f)! 
Xka]|*~2/rka(i—Ff)! this becomes 
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Finally, by using an argument similar to that which 
led from (A9) to (A10), the imaginary part of Ap is 
seen to be of the form 
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This integral can be expressed as 
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STORER 


Finally, the resultant expressions for the variational 
parameters are given by 


yia({(ka)?> 4/a\? 
rol 
prt 4 TNg 


1 ;(ka)* 32/a\4 
a LIeeee()} 
yap’ 16 wNX\g 


APPENDIX B. EVALUATION OF THE EXPLICIT 
ASYMPTOTIC EXPRESSION OF THE GREEN’S 
FUNCTION OF EQ. (11) 
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It is desired to derive an explicit form for the Green’s 
function of Eq. (11) in order to evaluate the far-zone 
electric field in region I. The expression for the Green’s 
function is given by 
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where the contour C is chosen as in Fig. 3. 

It is noted that the integrand is analytic except for 
simple poles at f=+y1 and {[=+7y,(m>1). Since the 
result is required for zz’ it is appropriate to close the 
contour with a large semicircle in the lower half-plane. 
It is apparent that the integral along the semicircle 
vanishes as its radius tends to infinity. Therefore, by 
the theory of residues, and noting that only the poles 
t=—y, and {=—iy,(m>1) are enclosed, one obtains 
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Since e*S(*-") represents a rapidly attenuated func- 
tion when zz’ and ¢ is negative imaginary, all higher- 
order terms are negligible compared to the dominant- 
mode term. 

Evaluating the residue corresponding to {= —71, one 
obtains the desired expression 
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over a Circular Grounded Screen* 
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The problem of a vertical monopole situated over a circular perfectly conducting screen lying on a finitely 
conducting ground is considered. An approximate methed employed criginally by Abbott to calculate the 
self-impedance is discussed. Using this formula explicit expressions are derived fer the self-impedance cf a 
thin, top-loaded vertical monopole. Sinusoidal current distribution is assumed. 





INTRODUCTION 


SUALLY the most important factor contributing 

to the inefficiency of low-frequency antennas is 
to be found in the ground system. The rules for ground 
system design are usually empirical and based on re- 
sults of experiments on existing installations. Recently, 
the general problem has been investigated analytically 
by Abbott! who has developed a design procedure to 
select the optimum number of radial conductors speci- 
fied given the values of the electrical constants of the 
ground. 

An important related problem is the actual change of 
input impedance of the antenna for different sizes of 
ground systems. This analysis has been carried out for 
a vertical antenna situated centrally over a perfectly 
conducting disk by Leitner and Spence? and more 
recently by Storer.* They, however, only considered the 
case when the surrounding medium was free space. A 
more appropriate situation is when the disk is lying on 
the surface of a homogeneous conducting half-space 
which corresponds to the ground. It is not surprising 
that the solution of this problem is in general very 
difficult. 


FORMULATION OF PROBLEM 


With reference to a cylindrical polar coordinate 
system (,¢,z) the antenna of height / is coincident with 
the positive z axis. The circular screen of radius a lies 
in the plane z= 0 which is also the surface of the ground. 
The conductivity and dielectric constant of the ground 
are denoted by a and e, respectively, and the dielectric 
constant of the air by €o. The permeability of the whole 
space is « which is taken to be the same as free space. 

Due to the obvious polar symmetry in this problem, 
Maxwell’s equations take the following form for time 
dependence according to exp(iw?): 


n O 
pE,=— —(pH¢) (1) 
Y Op 
* Paper presented at Joint International Scientific Radio Union- 
Institute of Radio Engineers meeting in Ottawa, October 6th, 1953. 
1F. R. Abbott, Proc. Inst. Radio Engrs. 40, 846 (1952). 
2 A. Leitner and R. D. Spence, J. Appl. Phys. 21, 1001 (1950). 
J. E. Storer, J. Appl. Phys. 22, 1058 (1951). 





n OH, 
E,=--—— (2) 
yY oz 
OE, OE, ; 
——=twH , (3) 
Op az 


where and 7 are the intrinsic propagation constant 
and characteristic impedance of the medium, respec- 
tively, and defined by 


= Linw(o+iwe) ]! 
and 


n= Lipw/(o+iwe) ]}. 


A subscript zero is affixed to these quantities in order 
that they should pertain to the air. That is, 


Te™ i(€ou) ly= i2n/d, 
where ) is the free space wavelength in meters and 
no = (u/€o) 377 ohms. 


Following the method of Storer,’ the difference be- 
tween the impedance Z of the antenna over a finite 
screen and the impedance Zp for an infinite screen is 
denoted by AZ and is given by 


No 710 
AZ=Z-2Z)=—- f (- ~pllst )I (eh. (4a) 
volo Yo p Op 





In this equation H,* is the change of the magnetic field 
H,; for a finite screen from the magnetic field H,” for an 
infinite screen. The current on the antenna is /(z) and 
the base current is Jo. It then follows that 


1 a] 
AZ=-— ry f H 3” (p,0)E,(p,0)2xpdp, (4b) 
o Yo 


where b exo ' 

1 @ exp| — Yo(2’+ 9") 
ESoh=-—— | ————I@e (e) 
2m dp Yo (2?-+p?)! 


THE APPROXIMATE SOLUTION 


The magnetic field in the ground outside the screen 
is a solution of the wave equation 


(A—~*)H4(p,2) =0, (5) 
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and therefore 
He(o,2) = f Tidrpe*pOAMA, z<0, (6) 
0 
where u= (A?+-”)! and the electric field is given by 
n oo 
E,=-- f uJ (Ape“*p(A)AdA, 2<0. (7) 
0 


Y 


The binomial expansion of u is of the form 
2 
war(14+—+ wes ), 
27" 


n oc 
E,= -*| f yJi(Ap)e“*p(A)AdA 
Yo 


so that 


+ f (1/29) JQroderspayntant |. (8) 


Now the differential equation for J is given by 


d10 
(— ~ <p-¥ )Js09)=0 
Op p Ap 


and so it readily follows that, for z=0 


2y* Op p Ap 
+terms containing y~*. (9) 


The second term is negligible if the propagation constant 
in the ground is sufficiently large and if pH, is not 
varying too rapidly with p. That is pH, should not 
change appreciably in a distance equal to |y~"| in the 
radial direction for points outside the screen. This 
approximation is always well justified under practical 
conditions at low radio-frequencies. 

Employing this result for Z, the equation for the 
self-impedance is now given by 


7 rs 
az=— f H4*(p,0)H(0,0)2medp. (10) 
0 p=a 


To apply this formula to an actual situation it can be 
assumed that H4(,0) is not very different from H,”(p,0) 
in the region of the ground plane where the losses are 
significant, and therefore 


azn f He Fexede. (AH 


An equivalent statement of this approximation is that 
the electrical radial current density at the surface of 
the ground is not appreciably affected by the finite 
conductivity of the soil. The approximate expression 


WAIT AND W. J. 





SURTEES 


for AZ in Eq. (11) was first given in this form by 
Abbott‘ who quoted it without proof but pointed out 
that it was a plausible result. Later Monteath® also 
arrived at this same formula from an extension of the 
compensation theorem well known in circuit theory. 
A better approximation to Eq. (10) might be to set 


where F(p) is Sommerfeld’s “surface wave attenuation 
factor” which is a function of the numerical distance p 
which for a good conducting ground (o>ew) is given by 


p= (rp/d) (€xw/c). 


When p<1 it can be shown or seen from tabulated 
numerical values® that F(p) is very close to unity. 
If p>1 the magnitude of F(p) drops appreciably below 
unity. However, since. H,* varies essentially as ¢~7/ 
and since F(p) is a slowly varying function the contri- 
bution from the integrand in Eq. (11) for the cases 
where p>1 are negligible. At low radio-frequencies 
(<1000 kc) and for moderate ground conductivities 
(c>10-* mhos per meter) the approximate formula 
given by Eq. (11) is sufficiently close to the formula in 
Eq. (10) to justify its use. 

Up to this point no restriction has been placed on the 
current distribution on the antenna. For thin antennas 
it can usually be assumed that 


I(z)=Im sin(a— Bz) (13) 
for 
0>2>h, 


where B=~0/i=2xr/A. The maximum current ampli- 
tude is J,,, and the conditions at the upper end of the 
antenna specify a. For unterminated ends J(#)=0 so 
that a= 8h and then 


I(z)=Io sin8(h—z)/sinBh. 


In some instances at low frequencies it is customary to 
load the antenna near the top so that effectively 
I(h)+0. It is convenient then to set 


a=B(h+h'), 
where h’ specifies the degree of top loading. 
Employing the value of J(z) given by Eq. (13) and 


inserting it into Eq. (4c) the integration with respect 
to hk can be carried out to give 


o e~¥e 








ifr 
cos (Sh—a)— 
p p 


th e~ ‘br 


4 m 
HH,” (9,0) = — | cosa 
2r 





sin(@h—) (14) 


pr 


4F. R. Abbott, Standard Radial Ground Systems for 1.f. and 
m.f. Monopole Transmitting Antennas. U. S. Navy Electronics 
Laboratory (San Diego), Report No. 219, Oct. 1950. 

“Gs D. Monteath, Proc. Inst. Elect. Engrs. (London) 98, Pt. IV, 
23 (1951). 

*K. A. Norton, Proc. Inst. Radio Engrs. 24, 1367 (1936). 
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Fic. 1. The incremental self-impedance AZ for a quarter-wave 
monopole over a circular grounded screen. 


where r= (p?+-?)!. Using this value for H,” the inte- 
gration indicated in Eq. (11) can now be carried out 
to yield 


n 
AZ = ————{ cos? (Bh—aa) 5+ cos’al 
2x sin’a 


— h? sin? (Bh—a)I,+ 2th cosa sin(Bh—a)I 3 
— 2 cos(Bh—a) cosal 4— 2th sin(Bh—a) 
Xcos(Bh—a)Io}, (15) 


where J;, J2, J3---ZJ¢ are integrals which can be ex- 
pressed in terms of the exponential integral 


i) og = 
Ei(—um)= -f dx 


x 





as follows: 


x e72iBr 1 
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pua 17 2h i 
— Ei — 218 (ro— h) Je?) 


© y-iB(ptr) 1 
lie f ——dp= {ei Bil —18(g+ 1)F] 
p=a r u 
‘ —e-**Fi[ —i8(g—1)h]} 
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L= dp= Ei(—iBhg) 


p=a p 


— Fil—iB(g+1)h]Je‘**— Eil—i8 (g—1)h Je 


«© p»—i2fr 
ki f "p= —3{ Fil — 248 (ro h) Je" 
conte + Fil — 218 (ro—h) ]e-®°) 





c) e2Bp 
I.= dp= — Ei(—2iBa), 


“"p=a «=P 


where 


1 
ro=(2+H)! and g=[(a+(e+ wy. 


The exponential integrals occurring in the above ex- 
pressions are all of imaginary argument so therefore they 
can be expressed in terms of the sine and cosine in- 
tegrals, Si(x) and Ci(x), respectively, as follows 


Bi(—is)=Cita)+4] Sita] 


When the antenna is not terminated such that 
h'=0 or a=8h the expression for AZ simplifies to 


n 
AZ= —_| et - 218 (ro+h) je?" 
4m sin?Bh 


+ Ei{ — 2i8 (ro— h) Je**®*+ 2 cos*BhEi (— 2i8a) 
+4 cosBh{ Ei(—iBhg)— Ei —iBh(g+1) Je*** 


— Bi —iahe— 1] | (16) 


It has been found that the results for AZ can be put 
in a very convenient form by writing 


AZ= AR+iAX 


and then plotting AR as a function of AX for values 
of the ratio a/h. The curves take the form of spirals 
which are somewhat similar to the “impedance spirals” 
of Storer* for the antenna and disk situated in free 
space. 

As a specific example the impedance spiral for a 
quarter wave vertical monopole (i.e. s=X/4) without 
top loading is shown in Fig. 1. Since the displacement 
currents in the ground can be neglected at low radio- 
frequencies (i.e. w<o/e) the curves can be conveniently 
normalized by multiplying AR and AX by (o/f)!. It is 
interesting to note that AR approaches infinity as 
a/h approaches zero. This is not surprising since the 
ground loss would be infinite if the lower end of the 
antenna were in direct contact with a homogeneous 
semi-infinite conductor. 

Further calculations from Eqs. (15) and (16) have 
been carried out in this laboratory by Mr. D. A. 
Trumpler. In addition the effect of the ground screen 
on the radiation field has also been studied. These 
results are the subject of subsequent papers.*” 
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Plastic deformations in beams caused by symmetric dynamic loads distributed over finite Jengths are 
considered in this paper. The analysis is based on the assumption of “plastic-rigid” behavior, so that the 
results should be valid when plastic defcrmations are sufficiently large. The final deformations for dis- 
tributed loads are found not to differ much qualitatively from those to be expected for concentrated loads. 
However, the magnitude of the deformation decreases rapidly as the length of the loaded area increases 
from zero, for given load pulses. Thus the simplifying assumption of a mathematically concentrated load 
may cause a considerable overestimate cf the final deformation. 





1. INTRODUCTION 


N a recent paper' a method was described for calcu- 

lating the permanent deformation of a bar of ductile 
metal subjected to a transverse impact load concen- 
trated at the midpoint of the bar. The basis of the 
method, originally suggested by Prager, is that elastic 
strains are neglected entirely, the bar being assumed 
to be rigid except at cross sections where the limit, or 
fully plastic, moment Mo is maintained. At such sec- 
tions it is assumed that plastic strains of indefinitely 
large magnitude can occur. Since discontinuities in 
slope angle can take place at such sections, while the 
bending moment remains essentially constant there, 
these sections are conveniently termed “plastic hinges.” 
This concept has been found to have valuable applica- 
tions in predicting static loads which produce failure of 
the “plastic collapse” type in civil engineering struc- 
tures. Papers by Baker? and Symonds and Neal* may 
be consulted for descriptions of these applications, and 
for details concerning the significance and calculation 
of the limit moment magnitude. 

Whether or not the concept of localized deformation 
at plastic hinge sections can be usefully applied in 
problems of dynamic loading depends largely on how 
much the energy actually absorbed during plastic 
deformation under a given dynamic load exceeds the 
maximum elastic strain energy which could be stored 
in the structure. A criterion based on this hypothesis 
was presented in reference 1. Experimental investiga- 
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Fic. 1. Type of load distribution assumed in this paper. 



































*The results presented in this paper were obtained in the 
course of research sponsored by the U. S. Office of Naval Research 
under Contract N7onr-35810 with Brown University. 

- a Lee and P. S. Symonds, J. Appl. Mech. 19, No. 3, 308 
? J. F. Baker, J. Inst. Civil Engrs. (London) 27, 188 (1949). 
*P. S. Symonds and B. G. Neal, J. Franklin Inst. 252, No. 5 

383-407 (1951); No. 6, 469-492 (1951). 


tions designed to test the predictions of this so-called 
“plastic-rigid” type of dynamic analysis will be highly 
desirable, and it is hoped that the results of the present 
paper will be helpful in planning and interpreting such 
experimental work. 

It will be particularly desirable, in connection with 
experiments and with practical applications in general, 
to know how certain load characteristics may be ex- 
pected to influence the results. In a previous paper,' 
the influence of characteristics of force pulses was 
investigated, the force being supposed mathematically 
concentrated at the midsection of the beam; as in 
reference 1, the case of a uniform beam with free ends 
was considered. In reference 4 a simple empirical for- 
mula was suggested by means of which the final central 
angle of deformation produced by a concentrated force 
could be estimated from knowledge of the total impulse 
and peak force value; the shape of the force-time curve 
was found to be relatively unimportant. 

The present paper is concerned with the differences 
from the results of reference 4 to be expected when the 
load is distributed over a finite length of the beam rather 
than concentrated at the midsection. In order to obtain 
both qualitative and quantitative comparisons as simply 
as possible, the only form of load distribution curve used 
is that indicated in Fig. 1. The total load P is uniformly 
distributed over a length 2e where e</. Thus the param- 
eter specifying the distribution of the load is the ratio 
e/l=k. 

A previous and first study of this problem has been 
given by Salvadori and DiMaggio. They used a 
smoother but more complicated load distribution func- 
tion, the load intensity being written as 


—e(2z/l) 
PG)= 2it1— ai “) } as 


where P is the total load, x is distance from midsection, 
and c is a concentration parameter which may vary 
from zero (corresponding to a uniform distribution over 
the entire beam) to infinity (corresponding to concen- 





*P. S. Symonds, J. Appl. Mech. 20, 475-481 (1953). 
5M. G. Salvadori and F. DiMaggio, Quart. Appl. Math. 11, 
223-230, (1953). 
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tration of the load at the midsection). The authors 
carried their study only up to the point of showing 
that a sufficiently “spread out” load, characterized by a 
value of the parameter c less than 3.45, would introduce 
a qualitative difference in the behavior of the beam. 
This apparent difference will be discussed later in the 
present paper. With a simpler form of load distribution, 
as used in this paper, it is not difficult to carry the 
investigation further, and to obtain quantitative com- 
parisons between plastic deformations occurring with 
various degrees of localization. These quantitative 
comparisons and more complete qualitative compari- 
sons between final plastic deformations are the main 
purpose of the present paper. The main practical result 
is that a small degree of spreading of the load over a 
finite length of the beam leads to a considerable reduc- 
tion of the total deformation at the midsection. 
Qualitatively, however, the present analysis shows that 
the final shape of the deformed beam will not differ 
radically from that predicted by analyses assuming the 
force to be concentrated at the midsection. 


2. ANALYSIS OF EARLIER STAGES OF DEFORMATION 


Taking the load to be uniformly distributed over the 
length 2e as shown in Fig. 1, we consider the successive 
phases of motion and deformation which can occur. 
We suppose the total load P increases steadily up to a 
maximum value and then decreases to zero, so that force 
pulses of the general shape shown in Fig. 2 are con- 
sidered. 











* : 











le T ais 
(a) (b) 
Fic. 2. Typical force pulse loadings. 





Under these conditions, as the force increases from 
zero, its first effect is to cause a translation of the bar 
parallel to itself. Then all points of the bar have the 
acceleration a= P/2ml, where m is the mass per unit 
length. This initial translation is termed the “first 
phase” of the motion. The constant acceleration has the 
effect of a uniformly distributed loading, and the bend- 
ing moment at any section x [positive bending moments 
defined as in Fig. 3(b) ] is given by 


m(l—x)? 
x>e: M(x)= =. 


(2) 


m(l—x)? P (e—x)? 
x<e: M(x)=—- ; a+— ’ 
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Fic. 3. Effective load components in the second phase. 


where x is measured from the middle of the beam. The 
bending moment in this phase has a minimum at +=0 
given by 


mia Pe Pl Pe 
M (0) = —-——_+—= —-—+—. (2a) 
2 4 4 4 


This first phase ends when the central moment reaches 
the value — Mo, corresponding to a load P; given by 


Pil 4 4 
—= : or pwr=—, (3) 
My 1- e/l 1—k 





where, for convenience, we define P/M =u and e/l=k. 

In the “second phase” of the motion, the two halves 
remain rigid but rotate with respect to each other, as 
in Fig. 4(a). Let ao denote the linear acceleration of the 
center, and ao= d°6)/df denote the angular acceleration 
of the right-hand half (positive clockwise). Assuming 
§) to remain small, the equations of motion are 


P P l é mE co 
—= ml (ao— 31a) ; —(--“)-m- ° (4) 
2 zX2 2 12 
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Fic. 4. Possible types of deformation: (a) second phase; 

ve hinge at mid-point only; (b) third phase with additional 

inges at interior points:in each half-beam; (c) third phase with 
development of finile central plastic region. 



































Solving these for ao and ao, we obtain 


Mo 
ao=—{(2—$k)—6] 
mi? 


Mo 
lag=—{_3u(1—k)— 12). 
ml? 


This second phase continues until the moment at 
some additional cross section reaches the magnitude Mp. 

To see how this may occur, it is helpful to refer to 
typical diagrams of effective load, shear force, and 
bending moment in the right-hand half of the beam. 
The effective load intensity g, with sign defined as 
in Fig. 3(b), can be regarded as made up of three 
parts, as shown in Fig. 3(a); these are the intensity 
P/2e=(M>/?)(u/2k) of the given load distributed 
over the length e, the uniform load intensity —ma(I/2) 
=—(M /P)(u/2) corresponding to the acceleration 
a(//2) of the center of gravity of the half-beam, and 
finally the linearly varying effective load intensity 
corresponding to the angular acceleration ap of the half- 
beam, whose maximum value is mao(Il/2)=M)/? 
<([$u(1—k)—6]. g, V, M curves are shown in Fig. 5. 

Two limiting conditions are evident from the dia- 
grams of Fig. 5. In the first place, unless q(#) is positive, 
the V curve cannot cut the axis in the unloaded posi- 
tion x>e, and hence there can be no positive moment 
in the beam. Hence, in order for it to be possible to 
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develop a plastic hinge at a point where the maximum 
moment reaches the value +Mo, we must have u and 
k related so that 


M 
2u(1—k)—6——>0 (6) 


or 
u(1—3k)>6. (6a) 


Secondly, at the left-hand end of the half-beam, q(0) 
must remain positive ; if it did not, V would have nega- 
tive values in the neighborhood of x=0, and hence 
M(x) would decrease below — Mp in this region. This 
requires the following relation between yz and k: 


rm m 
——~ felt —B)4-6——->0 (7) 
2k 2 
or 


u(3k—1)(1—k) <12k. (7a) 


It will then be evident that the second phase may be 
terminated in some cases because there is a positive 
moment in the interior of the half-beam [as in Fig. 
5(c)_] which reaches the value M, at a sufficiently large 
value of the load; and in other cases because the 
moment in the neighborhood of the midsection reaches 
the negative limit moment value — Mp over a finite 
length of the beam, as in Fig. 4(c). 

The first case, occurrence of a “positive” plastic 
hinge in the interior, is possible only if the inequality (6) 
is satisfied. From (6a), therefore, it is evident that in 
order for this new hinge to form in the interior of each 
half-beam it is necessary to have 


k<Z, (8) 


The second case, of spreading of the central hinge, can 
occur only if inequality (7) is mot satisfied, and hence 
only if 

k>3, (9) 


since inequality (7a) will be satisfied, regardless of yu, 
if k<}. 

In the case of a concentrated force,'* the second 
phase ends when the positive bending moment reaches 
the value Mo, and this always occurs when the load 
parameter uw reaches the value wz;= 22.89. The con- 
siderations of the preceding paragraph show that when 
the load is sufficiently spread out (k> 4), the formation 
of interior positive hinges will not take place in this 
manner. This is the main qualitative difference caused 
by loading over a finite length of the beam. Instead of 
new outboard hinges making their appearance, for 
sufficiently large k values the new effect is the spreading 
of the central hinge, which in the earlier part of the 
motion is always concentrated (according to the basic 
hypotheses of the method) at a single section. 

We now put in more precise terms the conditions for 
(1) occurrence of a positive plastic hinge in the interior 
of the beam, and (2) spreading of the central negative 
plastic hinge. 
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PLASTIC DEFORMATION 


(1) Occurence of lateral plastic hinge: 
For e<x</, the bending moment is given by 


P € MLA, May 
M()=—(2—-)— Me { ; (10) 
2 2 2 6 


This moment will have a stationary maximum at a 
point which can be found by solving the equation 





dM P mMxXa9 
V =—=—— mxayn+—— = 0. (11) 
dx 2 2 
Using Eqs. (5) this yields 
x ML 
=—— (12) 


1 3u(1—k)—12 
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Fic. 5. Typical diagrams of effective load intensity and associated 
shear and bending moment diagrams in second phase. 
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Fic. 6. Plots of u at which the second phase ends as junction 
of k=e/I corresponding to (a) appearance of lateral “positive” 
plastic hinges, and (b) spreading of the central hinge. 


This value of x is put in Eq. (10) to give the expression 
for the maximum moment. Then the value of yw for 
which the second phase ends, is found by setting the 
resulting expression equal to Mo. The following cubic 
equation in wz; and k is obtained 


(3k—2)*ur7°+36(12k?— 18k+7)ur7* 
+423(5k—4)u77+3456=0. (13) 


For any particular value of k, the real root of Eq. (13) 
then gives the value of u for which the lateral hinges 
first appear, on the assumption that the kinematic 
picture of Fig. 4(b) is correct. 
(2) The central hinge must begin to spread if 
?M dV | 


Ax? | ong 9X | cd 


=q(0)=0. (14) 


The critical relation is, therefore, obtained by replacing 
the inequality in (7a) by an equality, which yields 


12k 


ae ae (15) 
(3k—1)(1—k) 


The curves of u as a function of k according to Eqs. 
(13) and (15) are plotted in Fig. 6. It is found that the 
positive branch of the curve from Eq. (15) crosses that 
from Eq. (13) at k=0.361, urr=81.4. We conclude, 
therefore, as follows: 

(1) If k<0.361, the second phase ends with the 
formation of outboard hinges. The initial location of 
these hinges will be found from Eq. (12), and the value 
of the load parameter yu at which they appear is the real 
root of Eq. (13). Plastic regions appear in the manner 
indicated in Fig. 4(b). 

(2) If k>0.361, the termination of the second phase 
will be due to the spreading of the central plastic hinge 
into a finite region in which the moment has the con- 
stant value —Mpo. The load parameter at which this 
effect begins is given by Eq. (14), and the corresponding 
type of deformation is indicated in Fig. 4(c). 

This critical value k= 0.361 corresponds to the critical 
value c= 3.45 found by Salvadori and DiMaggio for the 
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Fic. 7. Free body diagrams for motion in third phase with 
concentrated central hinge. 


smooth load distribution used by them. They refer to 
“splitting of the central hinge” as the effect resulting 
when Eq. (14) is satisfied before the lateral hinges form, 
and they point out that the moment has the constant 
magnitude Mp in the central portion of the beam. The 
use of the term “spreading” seems preferable to “split- 
ting,”’ which might be taken to imply the division of the 
original central hinge into two distinct hinges separated 
by a region moving as a rigid body. A true splitting of 
this sort could occur if the intensity of the distributed 
load has a relative minimum at the center of the beam, 
but not otherwise. 

For the analysis of the miotion after the close of the 
second phase we assume first that the load distribution 
parameter k=e/} is less than 0.361 and then that k 
exceeds 0.361. 


3. ANALYSIS FOR k<0.361 


In this case the lateral hinges appear while the cen- 
tral hinge is still localized at the center section. The 
appropriate free body diagrams for the right-hand half 
are shown in Fig. 7, the distance of the lateral hinge 
from the center being denoted by x,. It is assumed that 
x,>e. The equations of motion, as in references 1 and 
4, are 





P 

F = man (ao— 3n000), (16a) 

Pfx, e mx3a9 

—(=-<)-2m-- (16b) 

Se 2 12 

0= m(I—x,)[ai+3 (/—axa)ar], (17a) 
(17b) 


Mo= —m (l— Xp )axy. 
12 


The fifth equation relating the five unknowns do, 4), 
ao, @1, and x, is that which expresses the discontinuity 
of linear acceleration at the hinge location x,, which is, 
in general, a function of time. This is (see references 1 
and 4) 


dx», 
ao— auare—[ertan(!— 21) J=— (wean), (18) 


where wo, w; are the angular velocities of the segments 
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to the left and to the right, respectively, of the hinge, i.e. , 


dw dw 
e=-, qre-. (19) 
dt dt 
Equations (16)-(18) yield the following system of 
equations: 











ca (20a) 
=—= &—k)—8 ], 0a 
ws dt mb? 
dw; 12M, 
= — = -—_____, (20b) 
dt mB(1—¢£)§ 
My 
do= [u(4E—3k)— 24], (20c) 
2mP 2 
6M, 
ab wnceepeemevenaneio, (20d) 
ml? (1—£)? 
Mofu 3uk 12 6 dé 
= pains |--Se-on, (20e) 
mPLE 22 # (1-—é) dt 
where 
Xap 
t=—. (21) 


These equations hold so long as the bending moment 
has a magnitude less than Mo everywhere except at the 
sections x=0 and x=x,. The effective load, shear, and 
moment diagrams for the segments to the left and right 
of the moving hinge, (Fig. 8) indicate that the critical 
condition will be a spreading of the central hinge. This 
will occur when the following condition is satisfied : 


’M dV 
dx? 


z=0 dx 





=4(0)=0. (22) 


z=0 








From Eq. (16) this leads to the condition 
24k 
B= 
(¢—k) (3k—£) 


The use of the limiting condition requires that Eqs. (20) 
first be solved, yielding (among other quantities), as a 
function of uw; these solutions will be valid as long as u 
is less than the value computed from Eq. (23). 

If any given finite force pulse is replaced by a rec- 
tangular pulse of the same peak force and the same area, 
i.e., total impulse, the final deformations computed fo 
the rectangular pulse will exceed the true ones for the 
original pulse, but for many purposes the results will 
be useful estimates (e.g., in error by 15 or 20 percent) 
(see reference 4). Therefore, to obtain quantitative com- 
parisons of total deformations with various values of k, 
we have assumed a rectangular force pulse shape, Fig. 
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PLASTIC DEFORMATION 


2(b). Since the details of the method of analysis exactly 
parallel those in reference 4, only the final results are 
shown in Fig. 9, where curves of the deformation 
parameters ml°@,/M oT? are plotted as functions of 
Uum=P,l/Mo for a series of values of the distribution 
parameter k. These results and others are discussed 
further in the final section of the paper. 


4. ANALYSIS FOR k> 0.361 


In this case the central hinge begins to spread before 
lateral hinges appear. At time ¢ the central hinge will 
have spread out a distance d from the center; in the 
region —d<x<d the moment is constant at the value 
—Mp». The equations of motion for the segment of 
length /—d are obtained from the free body diagram of 
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Fic. 8. Diagrams of effective load, shear, and bending moment in 
the third phase, with concentrated central hinge. 
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Fic. 9. Final central angle of deformation caused by various 
values of the load distribution parameter &, as function of maxi- 
mum load P of a rectangular force pulse. 


Fig. 10, and are found to be 


P 

=e hme the ~ 30 Bw) (24a) 
é 

P l—d e—d 1 

—-a|—— —|- Mo= —m (l—d)8ay’, (24b) 

2e 2 2 12 


where dy’ is the acceleration at x=d and ay’ is the angu- 
lar acceleration of the segment of length /—d. 

Equations (24) yield expressions for the accelerations 
as follows: 


, 


P “ (k-d) (4—r—38)—6| (25a) 


: oT 


12M> [x 
oe =| e-r)1-#)-1], 
mB (1—))*L4k 


where 


(25b) 


(25c) 


~)e& 


To determine d, we use the fact that the central hinge 
spreads at a rate just sufficient to prevent the moment 
near x=d from decreasing below the value —Mp. 
The condition is that 


aM dV 
dx? 


P 
=—| =q(d)=——ma'=0. (26) 
red dx 2 
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Fic. 10. Free body diagram of segment to right 
of central plastic region. 








Using Eq. (25a) and simplifying, it is found that A is 
the following function of u and k: 


u(3k—1)(1—k) —12k 
A= sane, 
2u(1—k) 





(27) 


Plots of u vs X for various values of & are given in 
Fig. (11). Thus for k>0.361 the critical portion of the 
beam (—x<X/<x) has the constant moment — Mo, so 
that the accelerations of all elements are proportional 
to the local load intensity P/2e. The two segments of 
length /—d=/(1—A) rotate as if rigid bars hinged at 
the sections x= +d [Fig. 4(c) ]. So long as the moment 
in the outer segments remains less than Mo in magni- 
tude, this is the correct picture. The load magnitude at 
which this picture ceases to be the correct one is that at 
which new “positive” hinges form in the interior of the 
outer segments. The maximum bending moment in 
these segments can be investigated by the same method 
as was used in the case of a localized central hinge. 
The following equation was obtained by setting the 
maximum bending moment in one segment d<x</ 
equal to Mo. It is analogous to Eq. (16) but depends 
now upon A= d/l in addition to k. 


pat edd)" (RD) 2+2— 3k) — 128] 
4k[3u(k—X) (1—k) — 12k} 
w(k—r? —wh(k—2)8(1—-D)* 
4k ORE 3u(k—d)(1—k) — 12k P 








To determine the load at which the additional hinges 
first appear, the above equation must be solved simul- 
taneously with Eq. (27). Alternatively, for any chosen 
value of k (>0.361), plots of u vs \ from Eq. (28) and 
Eq. (27) can be drawn, and the intersection of the two 
curves yields the initial value of u in question. As one 
example, the case k=0.5 has been worked out. In this 
case, when the two equations are combined we obtain 


ui— 192u?+ 1152u?— 110592=0. (29) 


This has the solution .= 186, corresponding to A= 0.242; 
the new hinge appears at x«=0.660I. 


5. NUMERICAL RESULTS AND CONCLUSIONS 


The type of load distribution considered here, with 
uniform load intensity over part of the beam and zero 
load over the rest, is intended less as a realistic loading 
than as the simplest type of distributed load to handle 


SEILER AND P. S. 


SYMONDS 


analytically. It seems intuitively clear that the qualita- 
tive results deduced from this simple case will apply to 
general cases of loads spread over finite areas. The main 
qualitative result is, in fact, that the general picture of 
the final deformation is exactly the same as that in the 
case of concentrated loading unless the length over 
which the load is applied exceeds about a third of the 
total length of the beam. Only when k exceeds the value 
0.361 does a qualitative difference appear. This consists 
of a spreading of the central hinge, so that instead of a 
plastic hinge, localized at the mid-section there is a 
finite length of the beam in which the bending moment 
has the magnitude Mo. However in real beams a hinge 
cannot be localized at a single cross section in any case 
since this would imply infinite strains; the plastic region 
is always spread over a finite region even when the load 
is strongly localized. Thus the final plastic deformation 
of a beam subjected to a sufficiently large distributed 
load will not in any case differ radically from that pre- 
dicted by an analysis using a mathematically concen- 
trated load. 

As far as quantitative comparisons are concerned it is 
of particular interest to investigate how a small degree 
of load spreading modifies the magnitude of the final 
deformation predicted by the simpler analysis for a 
concentrated force, since in practice loads are always 
applied over finite areas. 

To provide information of this kind calculations have 
been made of the final central angle @) produced by a 
series of loads with k values ranging from zero to 0.30. 
The force-time curve was taken, for maximum sim- 
plicity, to have a square-wave shape, as in Fig. 2(b). 
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Fic. 11. Dependence of width d of central hinge on load P for 

various values of load distribution parameter k=e/I. 
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PLASTIC DEFORMATION 


For this type of force pulse the lateral hinges move 
instantaneously to positions which they maintain so 
long as the load remains constant. The lateral hinge 
coordinate £ is found as the solution of the following 
equation: 





mw Suk 12 6 
+ = (). 
¢ 2° # (i-—&) 


This is Eq. (20e) with the right-hand side set equal to 
zero. Figure 12 shows curves of & plotted against u for 
values of k ranging from zero up to 0.30. The calculation 
of the further increments of deformation which occur 
after the load drops to zero are made in the manner 
described in reference 4. 

Figure 9 shows curves of the dimensionless param- 
eters ml°0o/M oT? plotted as a function of the dimension- 


(30) 





om] 








Fic. 12. Coordinate of lateral hinge x, for various values of load 
distribution parameter k, for constant load P. 


less load parameter m= Pm//Mo, for k equal to 0, 0.10, 
0.20, and 0.30. Figure 13 shows curves of ml*@o/M oT? 
plotted against & for various values of u,. These curves 
show that when the maximum load is large (e.g., 
Um> 40) the deformation falls off very rapidly. For ex- 
ample, with u,=40, the deformation parameter drops 
from about 800 for a concentrated load (k=0) to about 
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Fic. 13. Final central angle of deformation as function 
of load distribution parameter &. 








600 for k=0.05. This apparently large reduction caused 
by the distribution of the load over a finite length may 
be somewhat exaggerated, as compared with practical 
cases, because of the special conditions of the present 
calculation, i.e., the assumption of a rectangular force 
pulse and of a constant load intensity over the loaded 
segment of the beam. However, the results indicate 
that the assumption of a mathematically concentrated 
load may lead to an appreciable overestimate of the 
deformation, since the actual load is spread over a 
small but finite length of the beam. This result should 
be of significance in evaluating experimental results. 
It should perhaps also be noted that strain-hardening 
will cause a further reduction in the permanent defor- 
mation ; this effect will presumably be more important 
the higher the degree of concentration of the load. 
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That the noise convection current in a diode would be the same as the noise current in a triode under the 
conditions @,>>1 and 6:1 (@, and 62 are transit angles in radians of the potential minimum-to-grid and grid- 
to-plate regions, respectively) is shown by following the approach indicated by Rack and applying the 
Llewellyn-Peterson small signal theory to study the noise in a triode. Although the general trend of the 
experimental results agrees very well with the prediction from the Rack approach, the measured value of 
noise power is found to be about 2 db below the predicted value. The experimental results are applicable to 


triodes as well under certain conditions. 





I. INTRODUCTION 


N recent years many workers'~* have studied the 
signal and noise behavior of high-frequency tubes 
on the assumption that the velocity of electrons is 
single valued thus neglecting the effect of the thermal 
velocity distribution. In general, analyses such as the 
above have been quite successful in predicting the ob- 
served results. The effect of transit time on shot noise 
in vacuum tubes is studied in the present investigation 
by the familiar Rack-Llewellyn-Peterson method.!:® 
The purpose of this paper is to investigate the behavior 
of noise convection current in a diode and electron 
streams under large transit time conditions. It may be 
remarked here, however, that the present results are 
applicable to triodes as well under certain conditions. 
The Rack-Llewellyn-Petersonf method employs a 
set of equations for a diode region obtained by Llewel- 
lyn.’ Llewellyn and Peterson’ have shown that the ac 
behavior of a diode-type region can be described by 
specifying certain boundary conditions at one of the 
two planes bounding the diode region. The assumptions 
made in the above analysis are given elsewheref and 
will not be repeated here. The R-L-P method essen- 
tially consists of the following: first, the spectral den- 


sity of the mean-square fluctuations of the average » 


velocity of the electrons at the potential minimum is 
obtained; then, instead of considering a multivelocity 
electron stream from the potential minimum on, a 
single velocity stream having the same mean-square 
fluctuations is considered. A procedure such as Rack’s 


* Based on a portion of a dissertation submitted by the author 
to the University of California. Reference is made to this dis- 
sertation for complete details. 
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pany, Inc., New York, 1950), Chap. X. 

*L. D. Smullin, Tech. Report No. 142, Research Laboratory 
of Electronics, Massachusetts Institute of Technology (1949). 

5D. A. Watkins, Proc. Inst. Radio Engrs. 40, 65 (1952). 

6 A. J. Rack, Bell System Tech. J. 17, 592 (1938). 

t From now on we shall use the abbreviation R-L-P method for 
Rack-Llewellyn-Peterson method. 

7F. B. Llewellyn, Electron Inertia Effects (Cambridge Uni- 
versity Press, Cambridge, 1941). 

t See, for instance, reference 2. 


which is based upon a hypothesis must be supported 
by experimental results or more detailed analysis. 


In recent years, experimental investigations have 


been carried out by Duvall,* Kompfner, ef al.,° and 
Lavoo” on noise in diodes under fairly large transit 
time conditions, and by Cutler and Quate," Watkins,' 
and others regarding noise in beam-type tubes. 


Il. ANALYSIS 


Let us investigate a triode and find how the noise 
will vary with the cathode-to-grid transit angle. The 
region between the cathode and the potential minimum 
is neglected since otherwise this would contradict the 
assumption that the velocity of the electron stream is 
single valued. Only space-charge limited emission is 
considered below since this is of greatest interest in 
practice. 

Consider a parallel-plane space-charge limited diode 
region whose entrance plane a is taken to be just beyond 
the potential minimum and whose exit plane g is 
infinitesimally close to the actual position of the grid 
wires. Considering a small frequency interval of width 
Af, one can write the following electronics equations ‘§ 


I,=a(V.— V a) +G12Gat G13, 
Jo= 421(Vg— V a) +422Gat+ 2300 (1) 
Vg= 031(Vg— Va) +G32GatGsi0a. 


The coefficients a@;, @j2°++d@33; are functions of fre- 
quency and space charge, but, if the frequency interval 
of the measuring apparatus is small, they can be con- 
sidered to be constant throughout the range of fre- 
quencies between f and f+A/. (V,—V,) is the fluctua- 
tion in potential across the diode, J is the fluctuation 
in total current (convection plus displacement currents) 
passing through the diode region a—g, qa, Va and qy, % 
are fluctuations in the convection current density and 
velocity at the planes a and g respectively (see Fig. 

8G. E. Duvall, Tech. Report No. 82, Research Laboratory of 
Electronics, Massachusetts Institute of Technology (1948). 

*K. Kompfner, et al., J. Inst. Elec. Engrs. (London) 93, Pt. 
3A, No. 9 (March-May 1946). 

1 N. T. Lavoo, Proc. Inst. Radio Engrs. 37, 383 (1949). 

1 C. C. Cutler and C. F. Quate, Phys. Rev. 80, 875 (1950). 


§ The Peterson form of the electronics equations is used here, 
see for instance reference 2. 
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1). Complete space charge in the cathode-to-grid region 
is assumed in the following treatment. According to 
the notation of Llewellyn and Peterson, the space- 
charge factor|| :=1 for the complete space-charge 
limitation. Also the dc velocity u_ at the potential 
minimum is taken to be zero. 

Using the above assumptions, one obtains for region 
(1) the following] from Eq. (1): 


I,g=an™(V,— Va) +413 (vs)a 
Qo= 421" (Vg—Va) +425" (vs) (2) 
Vg= a3," (V.— Va)+a33 (vs)a. 


Note from Eqs. (1) and (2), that if a small amount of 
initial fluctuation current is assumed to exist at the 
plane a the contribution from this fluctuation is negli- 
gible in comparison with that due to a fluctuation in 
velocity at the plane a. Now, treating again the grid- 
to-plate region as another diode we can write: 


[,= an (V>— Vg) +12 Largg tN’ ]+413° 0, 
Qp= au” (Vp—Vg)+a22L[arggtN’]+a23, (3) 
p= (Vyp—Vo)-+aa2™Largg +N’ ]+-a020, 


where (1—a;) is called the differential capture fraction 
of the grid, owing to the fact that the grid may be at a 
positive potential (see Fig. 1). Also, a new fluctuation 
N’ arises from the capture of electrons by the grid. 
The fluctuation N’ is introduced above to include the 
effect of partition noise which will be considered later. 

To find the total fluctuation in the plate current, it is 
necessary to know the value of J,. The problem now is 
divided into two sections, one on the negative grid 
case and the other on the positive grid case. 


(a) Negative Grid Case 
In the case of a tube with negative grid, the value of 
a, above is equal to unity and N’=0. From Eq. (2) 
and Eq. (3) we obtain 
Tp=an (Vep—Va)+ {012 a1 +413 a1} 
XK (Vo—Va)t (ap) af aie Ge3 + 41333}. (4) 








REGION 
GRID, PLATE 
| 
| | 
| | 
| REGION 
| f | 
| a—p> —Pag 

| 
| 
| | 
CATHODE Va: | 
Q q r) 
POTENTIAL 
MINIMUM 
Fic. 1. 


|| For definition of the space-charge factor, see, for instance, 
reference 1. 

{ Superscripts of the coefficients ai, @12°-+@33 indicate the 
region concerned. 


TRANSIT-TIME CONDITIONS 565 








One can then draw the general equivalent circuit for a 
triode with a finite u on a node basis as shown in Fig. 2, 
where Vg is the potential of the grid wires and Z is an 
impedance element connected between the actual grid 
wires and the grid plane in the electronic analysis. 
Z reduces to a pure capacitance when the grid is at a 
negative potential. The value of the capacitance agrees 
with that given by Maxwell’s screening formula in the 
absence of electrons.’ Other methods of calculating the 
equivalent grid potential have been used by North” 
and Zuhrt." If the tube has an infinite u (infinite ampli- 
fication factor), i.e., if the regions (1) and (2) are com- 
pletely shielded from each other, then the branch com- 
posed of Z and the current generator disappears from 
the above equivalent circuit. 

In Fig. 2, Yoo is the self-admittance of region (2), 
Y 12 is the mutual admittance between regions (1) and 
(2), and VY, is the self-admittance of region (1). That is: 


V22= a1, 
Yy2= {d12 do. +4331} (5) 
Yu=ay™. 


The corresponding value of noise current in the plate 
circuit owing to a fluctuation in velocity (v,), at the 
plane a is i, given by 


in= (vp) a{ 12 G23 4-413 433}. (6) 


As is usually done, we assume that the region (2) is 
space-charge free, and that £.—0. Hence, 


tn— (vs) a (G12 G23?) 
and (7) 
01271 +++ for | B2|<1. 


Where 62= j02= jwT>2 for region (2), 62 is the transit 
angle of region (2), T2 being the transit time of region 
(2). Substituting now the above result, we obtain for 
noise current in the plate circuit, i,2a23" (vy), using 
the proper value of dc current in a3") that is 


(in?) w= | @23™ |*( (07) 0)nv- (8) 


One now uses the noise velocity of Rack® given by 


((v/)?a)w= (AnkT./eT 0) (1-2/4) Af, (9) 


2 ED. O. North, Proc. Inst. Radio Engrs. 24, 108 (1936). 
18H. Zuhrt, Hochfreq. u. elek. 29, 73 (1937). 
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&m=1/r,= (12€/TonT 1‘), (10) 


where n=e/m for an electron, J»>=cathode current, 
T.=temp. of cathode, k=Boltzmann constant, Af 
=band width of measuring apparatus, e=dielectric 
constant of free space, and 7,=transit time of region 
(1). We obtain, 


(in? )w=S(0:)[3 (1-2/4) 4kT gm Af]. (11) 


A plot of S(@:) occurring in Eq. (11) is made in Fig. 3 
(reproduced from a private communication from J. R. 
Whinnery). 

The following comments regarding the noise current 
generator in the input circuit are in order: 


ini=@13"(vs)a, gives the noise current at the grid 
due to a velocity fluctuation at plane a. 


(int) w= | a3" 12a ya?) v= 3(1 —m/4)4kT 710A fG(O;) (12) 
where 
(13) 


and ¢y=de cond ctanrce of rerion (1). G(6;) gives the 
variation of noise current in the diode rezion with 
transit angle @,. In Fq. (12) one sould vse &:=1 to 
correspond to the cise Uf a divde r the cithode to 
grid region in a triode. 


Evaluation of (in?) for Small Values of 0, 
ay21--- for |B.|K1. 


G 6;) = (4 6,)\f2+0/— 2(c s6,+0, Si 19;) | 


We have to evaluate a2;") for small values of #; 
23 (2€/nT 7) [Bie "+3 ] 
neglecting terms involving powers >2 of 6). Then, 
(in? )wO( (07) <2) (4€°/9°T 14) (9+-6,?). (15) 
Writing T?= (3/eZo)(2kT -8m(1—7/4)) and using Eq. (9) 


(14) 


** Assuming initial average velocity is small at the potential 
minimum, the low-frequency plate impedance for a diode (in our 
case, the region between cathode and grid) can be written as 
shown. 


YADAVALLI 


and Eq. (10), Eq. (15) reduces to 


(in?) w2E2el oA fT? (1+6,7/g) (16) 


(17) 


that is, the limiting value of (i,?),, for small values of 
6, is the same as in the case of a diode whose transit 
angle is negligible. This is predictable anyway since 
the effect of the grid-to-plate region was completely 
neglected. 


for 
0:0, (in?) wO=T?2eD Af, 


Evaluation of (in?) for Large Values of 0, 
As in the previous case for |82|<1, we obtain 
i n==do3" (vs)a 


@23° PS (2e/nT :*) Bie) is for |8,|>>1. (18) 


As before we write expressions for ((v,)a”)w, gm, and 
I’; after substitution and rearrangement of terms, we 
obtain 

(in?) wo 2el oA fT?F (01) (19) 


where F(0,)=07/9. A plot of F(@:) of Eq. (19) is 
shown in Fig. 3 by a dotted line. Hence, the asymptotic 
behavior is that the noise power in the plate circuit of 
a triode varies as the square of the cathode-to-grid 
transit angle, provided the transit angle between the 
grid and the plate is small in comparison with the 
cathode-to-grid transit angle. 


(b) Positive Grid Case: Partition Noisett 


We use the same Llewellyn-Peterson equations for 
the equivalent diodes representing the cathode-to-grid 
and grid-to-plate regions in the following analysis. 


T,=ay° (Vp—Vg)+ (aie ayaa, +443 a3,} 
x (V4— Vat (v,)af dy. a2, ay 
+443 a33} +ayON’, (20) 


As has been stated, the fluctuation .Y’ is introduced in 
Eq. (20) to include the effect of partition noise, and we 
recall that the excess of noise in an electrode over the 
simple apportioning of cithode-current fluctuations is 
terme! partition noise. 

As hefore, concerning ourselves with the case |8.|<1 
and |6,|>>1, we obtain 


in {arrd23" (vs)a+N’} 
or (21) 
(in?) wool ?2el pA f(0:°/9)+((N’)?) av, 


where I? and other quantities have the same significance 
as in the negative grid case. NV’ is obtained from the 
expression{{ 


((N’)*) w= 2ea1(1—an)I p, (22) 


tt The effect of secondary electrons is not considered in this 
paper. However, this is under investigation at present. ° 

tt This expression is obtained on the basis that the effect of 
secondary electrons can be neglected, and that the grid is so fine 
that the probability of an electron hitting it is the same, regard- 
less of the position on the cathode from which it leaves. 
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where Jp is the dc current in the region preceding the 
grid. It can be seen above that as a;—1, N’—0, meaning 
that under the above circumstances, the effect of posi- 
tive grid is negligible. 


Ill. NOISE CONVECTION CURRENT IN A DIODE 
AND NOISE IN ELECTRON STREAMS 


One can see from Eq. (2) that the component of 
noise convection current at the exit plane of a diode 
region due to a fluctuation (vs), in the velocity at the 
entrance plane is given by 


(23) 


Hence, one can say that any investigation regarding 
the calculation of noise current in a triode under the 
conditions @,>1, and @.<1, and the experimental 
verification of the calculated result is one involving the 
noise convection current in a diode due to an initial 
velocity fluctuation at the entrance plane. Then, be- 
cause of the above reason, we say that the present in- 
vestigation is concerned with the measurement of noise 
in electron streams. 


Qg= 423" (v4). 


IV. EXPERIMENTAL TECHNIQUE AND RESULTS 


For the present investigation the experimental tubes 
were designed with the following characteristics in 
mind: 


(a) large transit angle of the cathode-to-grid region, 

(b) small transit angle of the grid-to-plate region, and 

(c) rectilinear flow of electrons (in the present case 
the beam is cylindrical). This is essential since it is 
desired to verify the results obtained from the one- 
dimensional theory of Llewellyn and Peterson. 


Transit angle of any region can be varied at any 
frequency by varying the distance between the corre- 
sponding electrodes, other conditions remaining the 
same. It can also be seen that it is convenient to make 
measurements at a low enough frequency to reduce the 
problems of circuitry; but at the same time the fre- 
quency should not be too low since the tube becomes 
unusually large, and one would have to face problems 
regarding emission and maximum current that can be 
drawn to ensure space-charge limited operation in the 
cathode-to-grid region. An electrode configuration that 
was chosen to approach the desired characteristics is 
shown in Fig. 4. It may be mentioned here that even 
though in the experimental tubes of the present in- 
vestigation an axial beam of finite lateral extent is 
produced by a Pierce" type electrode structure, such an 
approximation is valid considering the low-current 
densities at which the tube was operated to obtain 
large transit angles. 

The transit angle is a measure of the importance of 
space-charge waves. For the type of model chosen, the 
space-charge noise and velocity waves are taken into 


“J. R. Pierce, J. Appl. Phys. 11, 548 (1940). 
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account properly through the use of Llewellyn-Peterson 
equations. 

Also, considerable difficulty regarding the operation 
of initial experimental tubes has been experienced due 
to charging of the glass walls, which was circumvented 
later by using a better focusing electrode and a larger 
diameter glass envelope. 

To obtain the variation of y*§$§ with transit angle, 
one can make measurements either at a number of fre- 
quencies, or with various values of dc current flowing 
through the tube at a fixed frequency. In the present 
investigation the second alternative was chosen, except 
that measurements were made at two different fre- 
quencies, 34.6 mc/sec and 69.2 mc/sec. To measure 7” 
the noise generated in the experimental tube is com- 
pared with the noise from a saturated diode for which 
the transit time is negligible. It may be mentioned here 
that the external cathode-grid impedance is very small 
and consequently can be neglected, since the tube under 
consideration has a very low transadmittance which can 
be readily checked by calculation. One important 
feature about the measuring system used in the present 
investigation is that it has rather narrow band widths; 
this is very desirable since, if one obtains any informa- 
tion regarding y*, it will be at a particular value of 
transit angle, or more precisely at the average transit 
angle over a small range of frequencies. 

It can be seen from Section II of this paper that T? 
is a function of the operating conditions. Hence, any 
computations made in the determination of F(@,) in- 
volves a fair amount of algebra, since the various values 
of 71, gm, and I? would have to be computed. The ex- 
perimental curve for F(6,) vs 6; is drawn on a db scale 
in Fig. 5 for a negative grid case. We now recall that 
as 0,0, the triode under investigation would behave 
like a diode whose transit angle is small. The absolute 
value of noise at low frequency (6; small) has been 
measured which serves as a check point. Since only a 
segment (not the whole) of the curve of F (6) vs 6; can 
be obtained by measurements at either of the two fre- 


§§ Y=IF (6). 
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J 


quencies, at least one check point is necessary to con- 
nect the two segments of the curve. In the present 
investigation, actually two such check points were ob- 
tained. It can be seen from Fig. 5 that the experimental 
curve approaches asymptotically a value which is about 
2.0 db below the theoretical value predicted by the 
Rack approach. 

Proceeding then to the positive grid!!! case, one can 
write Eq. (21) in the following form: 


Io 





Git n= Dele f] 1 (1-19) |-2etoasree' on), 


0 g 


where Jo+/, is the cathode current, Jo the plate current, 
I, the grid current, and l',*=I?F(6,). Hence, using the 
experimental values of F(6,) for the negative grid case 
one can obtain the predicted values of F’(0,) for the 
positive grid case. Figure 6 presents both the experi- 
mental and predicted curves. 

Investigations have been made regarding the con- 
vergence and divergence of the beam by changing the 
potential of the focusing electrode. As expected, it was 
found that the noise in a converging beam is more, and 
in a diverging beam less, than in the case of a beam with 
rectilinear flow. Also, preliminary calculations made 
using the space-charge wave theory indicate that the 
finite diameter of the beam explains a reduction of 
noise of about 1 db at large transit angles (~12 radians), 
although similar reduction at lower transit angles is 
not explained. This is probably owing to the lack of 


\||| The value of @; in the case when the grid is at a positive po- 
tential is computed by treating the cathode-to-grid region as a 
diode whose dc current is (Jo+/,). 
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validity of the assumptions made in space-charge wave 


theory under large current conditions. 


V. CONCLUSION 


It can be seen from the above that the general agree- 
ment between the theoretical predictions and experi- 
mental results is good. Hence, this proves that a single- 
valued velocity theory is powerful enough to predict 
many of the characteristics of a vacuum tube, under 
small signal conditions. 

There is reason to believe that the main weakness 
of the theory is its assumption of single-valued velocity 
for the electrons. Hence, much work has to be done in 
this direction before the problem is completely under- 
stood. The modification of the Rack approach sug- 
gested by Robinson’ does not seem to be applicable 
in the present investigation. It appears, however, that 
the finite diameter of the beam does play an important 
role in the foregoing study, and is being investigated 
at present in greater detail. 
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16 F, N. H. Robinson, Phil. Mag. 43, 863 (1950). 
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Supersonic liquid jets were produced in air by means of a spring-loaded injector. At high jet velocities 
another type of breakup seems to occur besides the classical Rayleigh surface tension breakup, and the 
sinuous aerodynamic breakup. Rotationally symmetric waves are formed, and appear to “break” analogous 
to wind-produced waves on a body of water. The rate of cavitation of jets in water was studied for several 


jet velocities above the speed of sound in air. 





INTRODUCTION 


S early as 1878, Rayleigh! discussed the instability 

of liquid jets. In his treatment, the jet was assumed 
to be a cylinder of nonviscous fluid along which rota- 
tionally symmetric waves could be propagated by sur- 
face tension restoring forces. Since the surface area of 
the jet is not minimal for the cylindrical form, but for 
droplets of a certain radius, the cylindrical form is 
unstable. Rayleigh found that if the wavelength of a 
disturbance was greater than the circumference of the 
jet, the amplitude of the disturbance increased expo- 
nentially with time, and that the wavelength of maxi- 
mum instability was 1.4 times the circumference. 

Weber’ and Haenlein® studied another type of breakup 
in jets which occurs at velocities on the order of 80 
m/sec, where air friction forces become more pro- 
nounced. The effect of air friction was to modify the 
Rayleigh breakup, and also give rise to unstable sinuous 
vibrations of the liquid column. Since parts of the jet 
which are at greater radial distances from the original 
column axis have aerodynamic forces which tend to 
increase this distance, the oscillation amplitude in- 
creases until the jet is sheared. 

With velocities close to and exceeding the speed of 
sound in air, it appears that another type of breakup 
may occur. This phenomenon is the subject of the first 
part of this report. During the recent war, considerable 
interest was aroused in the theory of liquid jets by the 
development of metal-shaped charge jets produced by 
conical liners in high explosive cavities. Because of the 
high velocities obtained (about 10‘ m/sec) the inter- 
action of a jet with target material was assumed, in 
the simple theory, to be comparable to a liquid jet 
striking a liquid target. In this sense, the second part 
of this article can be considered somewhat analogous 
to the metal jet penetration of metal targets. 


* This paper is based on work performed under Contract No. 
AT-04-1-GEN-12 between the U. S. Atomic Energy Commission 
and the University of California at Los Angeles. 

1 Lord Rayleigh, Proc. Lond. Math. Soc. 10, 7 (1878). 

2 C. Weber, Z. angew. Math. u. Mech. 2, No. 3 (1931). 

3A. Haenlein, Forshung Gebiete Ingenierw. 2, No. 4 (1931). 
Natl. Advisory Comm. Aeronaut. Tech. Mem. 659. 

‘ Birkhoff, MacDougall, Pugh, and Taylor, J. Appl. Phys. 19, 
563 (1948). 


EXPERIMENTAL PROCEDURE 


The high-speed jets were produced by an injector 
used in biological applications which is manufactured 
by the R. P. Scherer Corporation, of Detroit, Michigan. 
The mechanism is essentially a spring-loaded plunger 
which forces a rubber stopper into one end of a small, 
drawn, stainless-steel capsule, the other end having a 
small hole, through which the jet emerges. The cross 
sections of the two nozzle shapes used in these experi- 
ments are shown in Fig. 1(a); the two hole diameters 
used were 0.13 mm and 0.20 mm. 

The photographic equipment consisted of two 0.25 
microsecond-duration Libessart sparks with a variable 
electronic delay between them consisting of a multi- 
vibrator circuit. The experimental arrangement is 
shown in Fig. 1(b). The jet was shot vertically down- 
ward into a small box which contained a photographic 
plate and a variable width slit facing the spark sources. 
This arrangement allowed two successive frames to be 
taken on the same photographic plate, from 10 to 120 


microseconds apart. 
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Nozzle: 0O+l3 mm (A) 
Velocity: 60 m/sec 

d: 0 (a) 
Scale: eam, 


Nozzle: 0-20 mm (B) 
Velocity: 450 m/sec 

d: 1 cm (b) 
Scale: ney 

Nozzle: 0-13 mm (A) 
Velocity: LS m/sec 

da: 0 (c) 
Scale: et LF 

Nozzle: 0-13 mm (B) 


Velocity: 170 m/sec 
d: 1.3 cm (d) 
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Nozzle: 0-20 mm (2B) 
Velocity: 60 m/sec, 295 m/sec 
d: 1 em (e) 
Scale: 4 2mm 








Fic. 2. Supersonic and subsonic jets. “d” is the distance from the left-hand side of the photograph to the nozzle. 


In order to increase the opacity of the jet for the 
shadowgraph technique, the fluid used was an aqueous 
solution of methylene blue (2.4 grams per liter) with a 
surface tension approximately ten percent higher than 
that of distilled water. There was no apparent difference 
in behavior between jets of distilled water and jets of 
the methylene blue solution, however. The jets were 
injected into air at atmospheric pressure. 


SUPERSONIC JETS IN AIR 


Figure 2(a) is a shadowgraph of a jet from a 0.13-mm 
Type A nozzle illustrating a type of breakup which 
sometimes occurs at high jet speeds. ‘‘A”’ is the leading 
wave accompanied by a detached head and a tail shock 
wave. ‘“B” is another wave which appears to be par- 
tially shielded aerodynamically by “A.” At “C,” a third 
wave is just beginning to form. The wave at “‘A”’ seems 
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to be “breaking” analogously to waves on an open 
body of water under high wind conditions. The de- 
tached head shock waves of waves “A” and “B” 
appear to be weak in the vicinity of the jet axis, indi- 
cating the possibility of a boundary layer near the jet, 
which could be caused by the initial part of the jet 
accelerating the air in this region. The speed of this jet 
as measured by the shock wave angles is about 460 
m/sec. The spray trail from wave “‘A” was observed on 
the negative to extend back as far as wave “B.” The 
breaking waves are apparently not caused by the initial 
pressure transients alone, since they have been ob- 
served in later parts of jets in several photographs, 
of which Fig. 2(b) is an example. This jet is from a 0.20- 
mm Type B nozzle, and has a velocity of approximately 
450 m/sec. The Reynolds number at the jet orifice is 
about 9X10; on this basis the fluid might be assumed 
to be highly unstable to turbulent flow. 

The second frame of Fig. 2(c) shows another wave 
accompanied by head and tail shock waves. This jet, 
from a 0.13-mm Type A nozzle, is accelerating, since 
there are violent collisions evidenced in the second 
frame, and there are no shock waves visible in the first 
frame; the diagonal line on the first frame of Fig. 2(c) 
is a scratch on the negative. The velocity of the wave is 
approximately 445 m/sec. 

The breaking waves can be seen on subsonic jets, 
but the form of the wave is not so definite as in the 
supersonic case. Figure 2(d) shows two successive 
frames of a subsonic jet from a 0.13-mm Type B nozzle 




















Fic. 3. Supersonic jet cavitation in water. 
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Fic. 4. Jet cavitation velocity (U) versus jet velocity (V). 


with a velocity of approximately 170 m/sec. From this 
photograph, it appears that at this velocity, the wave- 
length of maximum instability is about five times the 
jet circumference. 

Figure 2(e) shows a supersonic jet from a Type B 
0.20 mm nozzle. The jet tip has two head shock waves, 
a phenomenon which has been observed in several photo- 
graphs. The appearance of the two non-stationary shock 
waves may be due to abrupt changes in frontal area as 
the jet sheds. The velocity of the jet near the orifice 
(as measured by free-stream shock waves) is about 460 
m/sec, while the tip velocity is 295 m/sec, the ratio of 
tip velocity to jet velocity being 0.64. 


CAVITATION PRODUCED BY SUPERSONIC JETS 


The jets were studied with regard to their interaction 
with a surface of a liquid to determine their velocity of 
cavitation. In this study, the jets were shot vertically 
downward into a glass-walled container 8 cm by 5 cm, 
and 10 cm high, filled with water. The position of this 
container is shown by the dotted lines in Fig. 1(b). 
The distance from the jet nozzle to the water surface 
was 8 cm. Two successive shadowgraphs with an in- 
terval of 78 microseconds between them are shown in 
Fig. 3. The jet velocity at a distance of approximately 
5 cm from the water surface can be determined from 
the free-stream shock wave angle; and the speed of 
cavitation by measuring the cavity-tip displacement, 
since the time interval between the shadowgraphs is 
known. 

The vertical lines in Fig. 3 are produced by sugar 
crystals dropped into the water just before the shadow- 
graph was taken, on a line parallel to the photographic 
plate, and passing through the jet, in order to observe 
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the displacement of the water due to the cavitation. A 
small shock wave in the water is visible in the right- 
hand photograph of Fig. 3. The lines drawn on each 
side of Fig. 3 indicate how the cavity is displaced from 
its true position by refraction. The scale shown at the 
bottom of Fig. 3 is nearly the same for both the top and 
bottom parts of the photograph, however. 

The velocity of cavitation U is plotted against the 
jet velocity V, for several supersonic jet velocities in 


DUNNE AND B. 


CASSEN 


Fig. 4. These values were determined from cavity 
depths varying between 3 and 5 cm, where a steady 
state cavitation rate might be considered to prevail. 
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The relativistic differential equations of motion of an electron in the field of a linear accelerator are con- 
sidered. Treating only electrons near the axis, the field components are expanded in powers of distance from 
the axis r, and terms in r and ¢ of order two higher than the lowest order terms are neglected. The first in- 
tegral of two of the equations of motion are then obtained, and together with the equation of moticn for the r 
coordinate, give a second-order nonlinear differential equation for r as a function of the phase of the electron 
with respect to the peak of the accelerating electric field. Utilizing earlier results giving the dependence of 
phase on distance along the axis of the accelerator, a choice of a particular dc magnetic focusing field allows 
this differential equation to be integrated twice, from which ¢ as a function of distance along the accelerator 
is plotted. 

The need for a focusing field is indicated, and it is shown that the particular field chosen has the advantage 
that with it, the electron will oscillate between known fixed limits of r. The necessity of shielding the electron 
emitter from the focusing field is shown, and an indication of the magnitude of the field and length of the 





accelerator for which it should be used, is given for the Purdue accelerator. 


INTRODUCTION 


OMPLETE electron orbits for the first two sections 
of the Purdue linear electron accelerator have been 
worked out by Caplan and Akeley' for electrons in- 
jected along the axis of the accelerator. For the axial 
electron, there is only a longitudinal force, and the 
problem is one-dimensional. In practice, however, all 
of the electrons will not be injected exactly along the 
axis, and the electron will experience transverse de- 
focusing forces. To counteract these defocusing forces, 
a dc focusing magnetic field is used over part of the 
length of the accelerator. 
It is the purpose of this work to investigate the trans- 
verse motion of the electron under the action of the 
combined electromagnetic and focusing fields.” 


INTEGRATION OF THE EQUATIONS OF MOTION 


We will consider an axially symmetric TM wave 
upon which is superimposed an axially symmetric dc 


* Published as Linear Electron Accelerator Technical Report 
No. 1, Department of Physics, Purdue University, June, 1952. 

1 DPD. Caplan and E. Akeley, J. Appl. Phys. 24, 774 (1952). 

2E. L. Chu, The Theory of Linear Electron Accelerators, 
M. I. Report No. 140 (Microwave Laboratory, Stanford Univer- 
sity, May, 1951). Chapter six deals with the subject of this paper 
in a different manner. 


magnetic field. The total field components are then 
given by 
E.= EvJ o(k-r) cosy,’ 


E,=0, 

E,= (k,v)EoJ 1(R,r) siny, 
A.=H,(r,2), 

Ho=k,"EoJ (kr) siny, 
H,=H,/(r,z). 


In these equations, distance along the axis and distance 
off the axis in units of \/2r have been denoted respec- 
tively by z and r. The phase is expressed as y=z/v—1, 
where the time ¢ is in units of 1/w, and the phase ve- 
locity of the wave 2, is in units of c. For convenience in 
the discussion, the electronic charge e will be taken as 
positive, and the direction of the electric field will be 
defined as the direction of the force on the electron. 
k,= (1—v~)!, and is imaginary for »=0.7 in the first 
section of the Purdue accelerator. The only dimensional 
quantity in the above equations is the amplitude Ep 
of the axial electric field, which for the first section 


8J. C. Slater, Microwave Electronics (D. Van Nostrand Com- 
pany, Inc., New York, 1950), p. 276. 
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of the Purdue accelerator is 110 statvolts/cm. The dc 
focusing field will as yet be kept general. 

The relativistic differential equations of motion in 
cylindrical coordinates are then found to be 


d  € Eo ae 
—(mr)— mr? —— ~J(ke)(-—2) siny 
d cw k, v 


e 
——r6H,=0, (la) 
CW 
d ' é€ 
—(mr?6)——r(zH,—7H,)=0 (1b) 


dt cw 


d e({?r 
(mi)——| Boh (be) sin)—réH, 


dt cw 


7 


ekg 
——Jo(k,r) cosp=0. (1c) 


CC®) 


If H. is constant, then H,=0, and the integration of 
(1b) then yields 


mré= —er?H./2wo+C. (2) 


If we consider a more general H, and H., and expand 
these fields in powers of r, Maxwell’s equations, to- 
gether with the neglect of second and higher order 
terms, leads directly to the same result (2). Thus, if 
the electron is sufficiently close to the axis, the result (2) 
can be used even if H. is not constant, and 6 may be 
eliminated from Eqs. (la) and (1c), giving 


d G r seH.\* 
dp Kae 


dt mr? 4m\X we 


é Eo 1 
-=— s(n (--2) siny=0, (3a) 





we k, v 
d e Eo er C eH: 
—(mz)=— —7J,(k-r) siny——H,( —— ) 
dt we k, mcw rr? we 
ey 
——Jo(k-r) cosy=0. (3b) 
C@) 


It is to be noted in Eq. (3a) that the C?/mr* term 
corresponds to a defocusing force which increases as the 
particle approaches the axis. It would then be desirable 
to make C=0. If the subscript c refers to conditions at 
the cathode of the electron gun, from (2) 


C= (m.6.+eH., -/2wc)r-*. 


Since 6, is a statistical quantity, it cannot be hoped to 
have C=0 for all particles. The best that can be done 
is to make H.,.=0, i.e., to shield the cathode from the 
magnetic field. The effect of not shielding the cathode 
will be shown more clearly by the graphs of the motion. 


Neglecting second and higher order terms in r and 7, 
Eq. (3b) reduces to the equation of motion of an 
axial particle, the first integral of which is readily 
obtained as 


(1+ 9.*)!/v— p.=A (siny+), (4) 


where p.=2/(1—2*)!, A=eEo/mocw, and k is an inte- 
gration constant to be determined by the initial condi- 
tions. Thus the axial longitudinal solution may be 
assumed to be unchanged for particles with r and 7 
sufficiently small so that second-order terms may be 
neglected in comparison to the other terms in Eq. (3b). 

Using Eq. (4) and neglecting third-order terms, 
Eq. (3a) may then be written in the form 


d 
ae = C?/mr— (moA?/4)r(1—2)! 
X { (H./Eo)?—2 siny(siny+k)}. (5) 


[Since 2 is implicitly given as a function of y by Eq. (4), 
the time derivatives may be changed to derivatives with 
respect to y, and the variables 2 and / may be eliminated 
from Eq. (5). This results in a nonlinear second-order 
differential equation. If C=0, the equation is linear and 
may be put in the form of the general Lamé equation, 
and a series solution may be obtained. We, however, 
wish to consider the effect of the nonlinear term, and 
we also desire solutions which can be used for computa- 
tion other than in the immediate vicinity of the point 
about which the expansion is made. ] 

As was mentioned before, the term C?/mr' which is due 
to the initial angular momentum of field plus particle, 
corresponds to a defocusing force. This force becomes 
very large as the particle approaches the axis, so that 
if C=O, the electron will never return to the axis. 
This defocusing force becomes less appreciable, however, 
as the energy of the particle increases, because of the 
inverse dependence on m. Equation (5) also shows that 
the sense of the H, field is immaterial. 

In making rough calculations for the magnitude of 
the focusing field, Chu? assumes C to be zero, and takes 
as the focusing condition that d/di(m7r) <0. (This is a 
stronger condition than just taking 7 to be decreasing, 
since m itself increases in the motion.) This calculation 
may be easily made from Eq. (5), if again C is taken to be 
zero. Then for d/dt(m7) Z 0, (H./Eo)?S 2 sin) (siny+&). 
From Caplan’s and Akeley’s! results, 2(sin) (siny+) 
is plotted against z (Fig. 1) for particles having initial 
phase in the first quadrant. It is apparent from Eq. (5) 
that a positive value of 2 siny(sinY+) corresponds 
to a defocusing force. From Fig. 1, it is seen that for 
particles having an initial phase in the range between 60° 
and 90°, 2 siny (sin)+) is always positive; and for 
particles with initial phase between 0° and 60°, 2 siny 
X (siny+&) is positive at the beginning of the acceler- 
ator. Therefore, at the beginning of the accelerator 
where the focusing problem is most acute (because of a 
relatively small z); for the initial phases that give the 
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Fic. 1. 2 siny(siny+) plotted as a function of z for initial 
phases in the first quadrant, as calculated from the results of 
Caplan and Akeley (reference 1). 


largest energies, the radial force is defocusing. This is 
the reason for the use of a focusing magnetic field. 

From Fig. 1, it is seen that the largest value of 
2 siny(sinY+) occurs at the injection of the particle 
with initial phase 90° and is equal to 20.47. In the first 
section of the Purdue accelerator, y= 110 statvolts/cm. 
Then for H,S498 gauss, focusing action will occur 
throughout the entire section. 

While a general solution of Eq. (5) will not be at- 
tempted, a solution valid throughout the entire first 
section, and showing the effect of an initial angular 
momentum (C0), will be obtained under a somewhat 
special assumption. 

Let e= (H./E)*—2 siny(sin)+), and then Eq. (5) 
may be rewritten as 


d ‘ 
Tad ia ial (moA*/4)er(1— 2*)!. (6) 
t 


A general solution of Eq. (6) can and will be obtained 
under the special assumption that ¢ is constant. This 
will be approximately true if H, is constant and suffi- 
ciently large so as to make the variation in 2 siny 
X (sin’+k) negligible in comparison to (H,/ Eo)?. How- 
ever, that would necessitate a constant field of several 
thousand gauss which is unnecessary. Referring again 
to Fig. 1, it is to be noted that the curves for initial 
phase between 20° and 90° are similar in appearance. 
Thus if an H, field were set up so as to make ¢ constant 
for one particle, then it would be essentially constant 
for all particles with initial phase between 20° and 90°. 
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The desirability of having ¢ constant is not only for 
mathematical convenience, but as the solutions will 
show, if € is constant, then r will oscillate between 
fixed limits which can be calculated from the first 
integral. The field considered is shown in Fig. 2. 

Taking ¢€ constant, the integration of Eq. (6) gives 


P= +{ (1—2) (x—€Av?/4—C?/me?r’) } 3, (7) 


where « is a constant of integration. While a general 
solution will be obtained, a knowledge of the motion 
may be obtained directly from the first integral (7). 

Consider first C=0. Since ¢ is a positive constant, it 
follows from Eq. (6) that * is everywhere decreasing 
(except when the particle crosses the axis, in which 
case 7 changes sign.) Then, if at injection *;>0, r will 
increase until * becomes zero when r= (2/A)(«/e)!. 
Then r will begin to decrease, and will reach the axis 
with *= —{x(1—2*)}!. As the particle crosses the axis, 
* changes to +{x(1—2*)}! and again decreases to zero, 
so that the particle will oscillate about the axis with 
constant amplitude equal to (2/A) (x/e)?. 

If C=O, there will be two different values of r at 
which 7=0, given by 


r= 2x/€A*+ (2/€A*) (x?— €A2C?/m,?)!. (8) 
From Eq. (6), the radial force acting on the electron is 
(1—2°)3(C?/mor® — ermyA?/4), (9) 


and it is to be remembered that a positive force is a 
defocusing force. Neglecting 6., C=er2H.,-/2wc where 
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Fic. 2. Axial magnetic field as a function of distance along axis. 
The field is taken to be 750 gauss at the beginning of the accelera- 
tor, and the variation is chosen to make ¢ constant for the particle 
with initial phase 60°. (The variation in the values of ¢ for the 
particles with initial phase 20° and 90° are then less than 10 
percent.) 
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again the subscript c refers to values at the cathode of 
the electron gun. Then at the time of injection 


C2 erm A? 2H, 27; (- ) 
( ) a Amy? r; 
Eo . Hi; s 
— (—) | ( ) —2 sina (sind +) , 
Bae Eo 


Consider now the case where the cathode is not only 
unshielded, but in addition the field at the cathode 
is equal to the field at the beginning of the accelerator. 
Assume also that r-=7;, which will be true for particles 
travelling parallel to the axis of the electron gun. Then 


( C? one) Er e* siny;(siny;+ k) 


mor? 4 











mor® 4 





(10) 














(11) 
2myw*c? 
It is noted that under these conditions, the radial force 
on the electron at the beginning of the accelerator is a 
defocusing force. What is even more important, is 
that this radial force is independent of H,, which most 
certainly defeats the purpose of the magnetic field. 
From Eq. (10) it is clear that for a given H,,;, if H.,.=0 
(i.e. if the cathode is shielded), then the focusing force 
at the beginning of the accelerator will be a maximum. 
While further deductions may be made from Eq. (6) 
and its first integral (7), the general solution will show 
much more clearly these deductions. From Eqs. (7) 
and (4), it follows that 


+ (x—€A?r?/4—C?/m¢?r?)“hdr 
=+A-'{ (siny+k)?—g*}-tdy, (12) 


where g?= A~*(v-?—1). In the expression on the right, 

the minus sign is used for z<v (before the phase 

minimum), and the positive sign for z2>v. The integral 

of this term is an elliptic integral of the first kind as 

may be seen by making the substitution y=siny. 
Integration of Eq. (12) then gives 


2k’ e\} 
r= —— sn (<) {sn (Si,S2)— sn (S1,S2)} 


eA? g 
xk—€A’r?/2 2k 
+sin\(—) Ly (13) 
x’ eA? 


and in the special case when C=0, 


2 /«\! 1se\! 
-—(*) sn| —-(“) {sn—(S1;,S2)— sn (S1,S2)} 
AXe 2\g 
int ~“(; | 14) 
san 0) If 
[(ao) aan) 
UN e—eg/ Ny tet] 1 
1 
2 


k?— (g—1)?4} 
ies = 
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where 
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Fic. 3. 7,2 diagram for shielded cathode. The initial phase is 
labelled on each orbit. r;=0.296 (=0.5 cm) for each orbit. Orbits 
labeled I, II, and III have #,;=0.12;, 0, —0.12;, respectively. 


and 
x’ = (x?— €A2C?/m,?)!. 


Both of these solutions apply only before the phase 
singularity and before the first maximum in r. The 
modifications of the solution beyond these points are 
obtained by integrating in steps, taking into account 
sign changes in the functions. 


CONCLUSIONS 


Solutions (13) and (14) give r as a function of y. 
The dependence on z is then obtained from Caplan 
and Akeley’s' results of phase as a function of distance. 
r is then plotted against z for the shielded cathode in 
Fig. 3, and for the unshielded cathode in Fig. 4. (For 
the unshielded cathode, C was evaluated assuming 
that the focusing magnetic field is constant in the 
electron gun, and that the electrons travel parallel to 
the axis of the gun.) For the Purdue accelerator, the 
electrons are injected with an initial energy of 50 kev, 
and it is assumed that no electron enters the accelerator 
more than 0.5 cm off the axis. 

The solutions for the unshielded cathode (Fig. 4) 
indicate that at injection r is a minimum and defocusing 
will ensue. However, the actual solutions represented 
in Fig. 4 cannot be considered valid for the large values 
of r where the approximations made would not be 
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Fic. 4. r,z diagram for unshielded cathode. The magnetic field 
at the cathode is 750 gauss. r;= 0.296, #;=0. The initial phase is 
labeled on each orbit. 
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valid. However, the fact that r does become large is 
enough to make the situation undesirable and to be 
avoided. If the magnetic field at the cathode were less 
than that at the beginning of the accelerator, then the 
defocusing would not be as pronounced. However, as 
was indicated previously, for maximum focusing, the 
magnetic field at the cathode should be zero. 

As the energy of the particle increases, the radial 
forces become small. For this reason it is unnecessary 
to have the focusing field over the entire first section. 
In Fig. 2, if the focusing field, instead of having a 
minimum at z=3, were to continue dropping from 
z=2 (=3.37 cm) as indicated by the broken line, then 
the focusing force for z= 2 would be less than that giv- 
ing the orbits of Fig. 3. However, at z=2 all the elec- 
trons with 7;=0.296 (=0.5 cm) will be approaching 
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the axis, so that a decrease in the focusing force will 
mean a decrease in the slope of the curves, and conse- 
quently a larger value of z at which the electron will 
cross the axis. Preliminary calculations indicate that 
the values of r at the end of the first section of the 
accelerator would then be less than those indicated by 
Fig. 3. Thus it would appear that cutting off the field 
not‘only gives a saving in power, but will also improve 
the focusing. 

If the maximum value of the field employed is less 
than 750 gauss (but still greater than 498 gauss), 
then the field should be used over a greater length of 
the section. However, since it was necessary to fit the 
magnetic field to make e= (H,/E»)?—2 siny(siny+) 
constant, smaller fields would introduce larger errors 
and make the results of this paper less applicable. 
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The friction coefficients of freshly cut surfaces in inert atmosphere are compared with values on the same 
surfaces later exposed to air. The kind of metal or oxide comprising the friction couple was also varied. From 
experimenta] data it was concluded that: (1) Solid phase welding accounts for almost the entire magnitude 
of the static friction coefficient for metal-metal couples. (2) Fe-Ag does not solid phase weld, other insoluble 
couples examined do weld. (3) A measure of solid phase weldability appears to be the quotient of the work 
of adhesion and the shear strength of the weaker couple. (4) Titanium does not solid phase weld to crystalline 
oxides to any greater extent than copper. (5) Freshly cut titanium solid phase welds to other metals in fric- 
tional contact. The degree of welding is independent of the atmosphere (air, nitrogen, or argon) or the time 
(less than 24 hours) at room temperature in such atmospheres. This behavior is contrary to that experi enced 
with iron or copper. With the latter couples less than five minutes exposure of a freshly cut surface to air is 
sufficient to markedly reduce the degree of solid phase welding. (6) The seizability of titanium is probably 
due to the lack of resistance, of the contaminant film formed on it in air, to penetration or wear in frictional 





contact. 


INTRODUCTION 


HIS project has as its general objective a deter- 

mination of the factors that affect seizure and 
galling. As a first step toward the fulfillment of this 
objective, it was decided to study the solid phase 
weldability of different metals to determine whether 
there are exceptions to the rule that all clean metals 
weld to each other. The literature reveals many con- 
flicting statements making a conclusion on this question 
worthwhile. Specifically, Bowden and Tabor imply that 
all clean metals solid phase weld to each other.' Mer- 
chant? disputes this view, claiming that metals which 
are mutually insoluble do not weld to each other. Feng’ 
believes that solid phase welding is unimportant in 


1F. P. Bowden and D. Tabor, “Wear and Damage of Metal 
Surfaces,” Mechanical Wear, Editor, J. T. Burwell, Jr. (American 
Society for Metals, New York, 1950), p. 117. 

2M. E. Merchant, Discussion to reference 1, p. 135. 

#1. Ming-Feng, J. Appl. Phys. 23, 1011 (1952). 


friction, the major contribution to the friction force 
arising from mechanical interlocking of the two bodies 
in contact. 

The experiments to be described have been designed 
to provide a conclusive answer to this controversy. 
Also, because of the current interest in titanium, it has 
been included in the list of metals to be tested. 


EXPERIMENTAL TECHNIQUES 
Friction Apparatus 


A modified Bowden-Leben apparatus for measuring 
friction coefficients has been constructed and is illus- 
trated in Figs. 1(a) and (b). The shaper A is used to cut a 
clean surface on hard slider specimen B attached to 
piston C of hydraulic cylinder D. The box E maintains 
an inert atmosphere supplied from a tank of prepurified 
nitrogen or argon through a titanium chip furnace 


train to remove oxygen. After the shaper cuts a clean 
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surface on the slider, the piston C is moved forward 
under hydraulic pressure at a rate controlled by bleeding 
oil through a needle valve. The hydraulic pressure is 
supplied by a tank of nitrogen acting through the me- 
dium of oil contained in a reservoir. When the slider 
is properly under the hemispherical rider F, the motion 
of the slider is stopped. The hemispherical rider F is 
then lowered by means of screw G into contact with 
the slider. The load is applied by deflection of member 
H and measured by means of strain gauges and the 
Sanborn recording bridge 7. When the desired load is 
reached clamp J is tightened and the slider again set 
into motion by means of the hydraulic action previously 
described. The frictional force is then measured by 
means of the deflexion of flexure member K, attached 
strain gauges and another channel on the Sanborn 
recorder. 


Friction Tests—Materials and Methods 


The materials used in these tests are high-purity 
elements, with the exception of the copper and titanium 
used for the sliders. These elements and their nominal 


‘ purities, as received, are listed in Table I. Some of these 


elements were in powder or shot form and had to be 
made into usable shape. Melting and solidification into 
the desired shape was accomplished in cleaned, evacu- 
ated Pyrex tubes. Fine grained specimens were obtained 
from this treatment. The Brinell hardnesses of the 
specimens, as used, are also listed in Table I. 

The four hardest materials, Ti, Fe, Ag, and Cu are 
used as flat slider materials and are cut in place by the 
shaper to provide clean surfaces. The other materials 
and also Ag and high-purity annealed Cu are used to 
make the softer hemispherical rider. The riders because 





Fic. 1(a). Friction measuring apparatus (close up view). 
A. Shaper. B. Slider. C. Piston. D. Hydraulic cylinder. Z. Atmos- 
phere box. F. Hemispherical slider. G. Load adjustment screw. 
H. Load flexible member. J. Load clamp. 





Fic. 1(b). Friction apparatus. A. Shaper. D. Hydraulic cyl- 
linder. EZ. Atmosphere box. G. Load adjustment screw. 7. Sanborn 
recorder. J. Load clamp. K. Friction flexible member (enclosed 
within rubber gasket). 


of their relative softness are self-cleaning. The harder 
slider acts to cut or wear away the surface of the mating 
softer rider during a friction test. It is important to note 
however, that the rider is never run twice over the same 
path on the slider. Thus, if no material transfer takes 
place from the hard slider to the soft rider the type of 
bond is always a rider-slider bond and never a rider- 
rider or slider-slider bond. 


Friction Tests—Procedure 


1. A clean surface was cut on the slider in air. 

2. At least 15 minutes elapsed before making friction 
runs on this surface. 

3. At least 3 friction runs were made in air. 

4. The box surrounding the friction equipment was 
then partially evacuated and either prepurified, de- 
oxidized nitrogen or argon was allowed to flush through 
the box for at least 20 minutes. 











TABLE I. 
Brinell 
Metal Purity hardness 

Lead 99.999 3.2 
Copper rider 

annealed) 99.999 9 
Zinc 99.999 12 
Magnesium 99.999 17 
Cadmium 99.999 21 
Aluminum 99.99 27 
Tron 99.999 57 
Copper rider 

ce PM 99.999 57 
Copper slider 

(Electrolytic Tough Pitch) 99.9+ 62 
Silver 99.9 70 
Titanium 99.+ 189 
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TABLE II. Static coefficients of friction. 








Friction components Data obtained in air 


Data obtained in air on 


‘ —* 
Data obtained in N: or argon surfaces produced in 





Rider on slider i* run Succeeding runs Slider clean Rider +slider clean nitrogén or argon 
Al on Ti (3)* 0.52-0.65 0.67 0.68 0.69-0.87 0.65 

Al on Fe (3) 0.41 0.44 0.59 0.60 0.45 

Al on Cu (1) tee ee 0.54 0.90 0.45 

Ag on Ti (3) 0.48+0.1 0.51 0.60 0.66 0.54+0.1 
Ag on Fe (2) 0.41+0.1 0.44 0.46 S 0.48 0.47 

Ag on Cu (1) ee ee 0.29» = 0.75 0.29 

Cd on Ti (2) 0.64 0.69 0.68 = 0.75 0.71 

Cd on Fe (3) 0.57 0.55+0.1 0.57+0.1 S 0.71 0.59 

Cd on Cu (1) eee cee » 0.74 & 0.86 0.73 

Cu on Ti (2) 0.62> 0.62» & 0.66 & 0.65 0.62 
Cu on Fe (3) 0.51+0.1 0.57+0.1 £ 0.73 pe 0.72 = 0.64 

Cu on Cu (2) } 0.45+0.1 0.59 3 0.67+0.1 cS) 1.65+0.5 $ 1.30-0.7°¢ 
Mg on Ti (3) c 0.71 0.67 - 0.67 2 0.68 ==« v 0.66 
MgonFe (2) ‘S 02 0.47 = 0.44 = 0.45 2 0.44 

Mg on Cu (3) 5 ee ee $ 0.35 z 0.40 = 0.47 

Pb on Ti (4) pe 0.63+0.1 0.65+0.1 = 0.68 = 0.70 = 0.67 

Pb on Fe (4) 5 0.76-1.44 0.59-0.96 3 065-094 8 077-095 2  051-1.16 
Pb on Cu (2) na 0.42» 0.55» ns 0.55» a 1.28» a= 0.85> 

Zn on Ti (2) = 0.58 0.65 = 0.67 = 0.73 ° 0.67 

Zn on Fe (2) a 0.54 0.57 a 0.65 7 0.66 2 0.62 

Zn on Cu (1) eee tee 0.69 0.90 0.54 

Ti on Ti (1) 0.69 0.65 0.68 0.67 0.68 

Fe on Fe (3) 0.17-0.51 0.51 0.62 0.69 0.53 

Ag on Ag (2) ove eee 1.76 2.44 1.16 








The general uncertainty in each value is +0.05 except where noted differently. 


* Number of separate runs to determine average values shown in table. 
>Stick-slip absent. : , s 
¢ Value decreases with increased holding time in atmosphere. 


5. A friction run was then made on the same surface 
now in an inert atmosphere. If the friction value was 
the same as obtained in air then a new surface was 
cut on the slider, the cut being made in the inert 
atmosphere. (In all cases no change in the friction co- 
efficient using the oxidized slider was found as a result 
of changing the atmosphere.) 

6. At least 1 friction run was made on this freshly 
cut surface. 

7. The rider was then rubbed on a clean fine file at 
high load and speed to clean the rider and at least 2 
friction runs were carried out for this condition. 

8. The box was opened up to air, 5-15 minutes 
allowed to elapse and another friction run made to 
check the results obtained in item 3. 

9. The friction tracks were then examined using a 
microscope. 


Friction Tests—Data 


A summary of the friction data obtained with the 
above described procedure is presented in Table II for 
loads averaging between 30 and 120 grams and relative 
sliding speeds varying between 0.5-1 cm per minute. 
Variations of speed and load within this range were 


found to have little significant effect on the friction 


data. Stick-slip was found in most of the couples 
studied, except as noted. The friction values in the 
table correspond to the maximum values obtained 
during the stick portion of the stick-slip cycle. 

Results have also been obtained at relatively high 
loads with much the same results, except implications 





of surface melting for the cases of Pb, Cd, and Zn riders 
on Ti. The results for the lower loads are probably 


' representative of the solid surfaces. 


The effect of the hardness of the rider for a given 
rider material was also investigated for Cu on Ti. The 
cold worked copper having a hardness of 57 Brinell 
was annealed by heating to 550°C for 25 minutes to a 
hardness of 9 Brinell. The friction data for this series 
of experiments is presented in Table III. 

The reproducibility of the friction data was adequate. 
All data were within +0.05 of the values shown in 
Table II except where noted. 

An aid to the interpretation of Table II is presented 
in Table IV where the increment in the static friction 
coefficient on provision of a freshly cut slider surface* 
(4:1) and of a freshly cut rider surface (A2) are shown. 

The results presented in Tables II-IV may be de- 
scribed as follows: 


1. An increase in the friction coefficient on provision 
of a “clean” Fe or Cu surface in an inert atmosphere 
was observed for the rider metals, Al, Cu, and Zn. 

2. A similar increase on provision of a “clean” Ti 
surface was observed only for the Ag rider. 

3. An increase in the friction coefficient on provision 
of a “clean” rider surface on a “clean” slider surface 
was observed for the case of Al, Ag, Cd, and Zn riders 
on Ti and Cu sliders and for the case of a Cd rider on 
an Fe slider. 


* The provision of a freshly cut slider surface and a freshly cut 
rider surface, in an inert atmosphere, shall be referred to as clean- 
ing of the slider and rider metals, respectively. 
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TABLE III. Annealed versus unannealed Cu on Ti. 








Coefficient of static friction 


Average difference between static and kinetic coefficients 





Cut in air Cut in inert atmosphere Cut in air Cut in inert atmosphere 

Runs after Runs after 

ist run Succeeding Run in rubbing 1*%t run Succeeding Run in rubbing 
in air runs LA. rider in air runs LA. rider 
Unannealed 0.50 0.56 0.51 0.63 0.07 0.00 0.06 0.04 
0.50 0.55 0.69 0.00 0.00 0.06 
(57 Brinell) 0.57 0.53 0.50 0.50 0.19 0.03 0.00 0.02 
0.51 0.49 0.47 0.60 0.00 0.00 0.02 0.04 
Annealed 0.46 0.55 0.58 0.66 0.06 0.05 0.05 0.05 
0.49 0.48 0.46 0.54 0.03 0.03 0.02 0.05 
(9 Brinell) 0.57 0.52 0.52 0.57 0.03 0.02 0.04 0.03 

Load —1 kg 


Speed—0.5-1 cm per minute 








4. With both rider and slider “‘clean” the friction 
coefficients for a given rider metal were about the 
same on both Ti and Fe sliders for Cd, Cu, Pb, and Zn. 
(Al probably falls into this category as well.) 

5. Only for the case of Ag and Mg riders were the 
values for the “clean” state higher on Ti than on Fe. 

6. Ti run on itself showed a constant value of the 
friction coefficient whether “clean” or “unclean.” 

7. Fe or Cu run on itself showed an appreciably 
higher friction coefficient in the “clean” state as com- 
pared to the “unclean”’ state. 

8. Values of the friction coefficient for the Cu slider 
in the clean state are higher than for the same rider 
metals on Ti and Fe sliders. 


GUIDE TO THE INTERPRETATION OF 
FRICTION COEFFICIENTS 


Bowden has shown that if welding is assumed to occur 
between the friction components then the static fric- 
tion coefficient is related to the shear strength and the 
flow pressure (hardness) by the equation 


ANALYSIS OF RESULTS 


Correlation between Friction Data and 
Work of Adhesion 


Some of the questions raised in the introduction can 
be conclusively answered on the basis of the results 
obtained in this investigation. There is no doubt that 
solid phase welding occurs in frictional contact. The 
mere contact of a clean surface with air at room tem- 
perature markedly reduces the friction coefficient in 
many cases. The amount of such reduction is a function 
of the time the surface is bathed by the air. This reduc- 
tion is accomplished without any observable change in 
surface geometry. Increasing the temperature at which 
the surface is held in the air without any visible oxide 
film being formed and without any observable change 
in surface geometry reduces the friction coefficient 
from 0.7 to about 0.2 (see Table V). Thus, the assump- 
tion of Feng is found to be false because mechanical 
interlocking cannot explain the above results and thus 
can play only a minor role in friction phenomena. The 





friction couple used in this study, a soft hemispherical 

S : 
u=a-, (1) rider on a hard flat slider, should magnify this effect if 
H present. The conclusion is therefore applicable to other 


where y static friction coefficient, S shear strength of 
weaker component, H flow pressure of softer compo- 
nent, a fraction of contact area which has welded. 
According to Eq. (1) for the same softer and weaker 
rider metal sliding over different harder metals, the 
coefficient of friction should be independent of the slider 
metal provided that a, the fraction of the contact 
area that is clean (welded) does not change. For what 
has been called the clean state in this paper it has been 
found that the friction coefficient is the same for Fe and 


types of friction couples. 

Also, the assertion of Merchant, that solid phase 
welding will not occur between metals that are mutually 
insoluble, has been found inaccurate. Cd which is in- 
soluble in Fe, welds to Fe as well as any other metal 
tested (for example, Zn and Cu, both of which are 


TABLE IV. Effect of cleaning surfaces in nitrogen on 
static friction coefficients. 














Ti sliders, but is generally higher for Cu sliders, with 
given rider metals. This result can be interpreted as 
indicating that @ in the clean state for Fe and Ti is the 
same, but is larger for Cu. This interpretation is con- 
sistent with the fact that after thorough cleaning in 
high-vacuum friction coefficients can approach the 
magnitude of 100. Thus, variation in cleanliness alters 
the factor a, as indicated by the finite values of A; and 
A: in Table IV. 


Ti Fe Cu 

A A» Ai A: Ai As 
Al 0.01 0.20 0.15 0.01 0.09 0.36 
Ag 0.09 0.06 0.02 0.02 0.00 0.46 
Cd 0 0.05 0 0.14 0.01 0.12 
Cu 0.04 0 0.16 0 ore 1.00 
Mg 0 0 0 0 0 0 
Pb 0.02 0.02 vee cee 0.00 0.73 
Zn 0.02 0.06 0.08 0.01 0.15 0.21 








® Increase in static friction coefficient on cleaning slider in nitrogen. _ 
b Increase in static friction coefficient on cleaning rider and slider in 
nitrogen. 
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TaBLe V. Effect of heating Ti slider in air on 
static friction coefficient. 

















Friction Unheated Heated* 
components (in air) (in air) 
Cu on Ti 0.65 0.23 
Mg on Ti 0.68 0.23 
* Ti surface heated in air at 350°C for 17 hours, rider surface was cleaned 


by rubbing prior to test. No change in coefficients observed on performing 
test in air or in argon. 


soluble in Fe.) Some insoluble metals however, do not 
weld to each other. In particular, the rider metal Ag 
does not weld to Fe. The latter result also contradicts 
Bowden and Tabor’s implication that all metals in 
frictional contact solid phase weld to each other. 

In the terminology used in this investigation solid- 
phase welding takes place if separation of the two fric- 
tion components does not occur at the interface but 
rather within the weaker component. Thus, to predict 
whether a given friction couple will solid phase weld or 
not it is necessary to know the relative strengths of the 
weaker component and the interphase interface, under 
the conditions of stress imposed on the friction couple. 
These stress conditions are not well defined and hence 
it is difficult to state precisely the condition of antisolid 
phase welding. From the following argument those 
friction pairs which will certainly solid phase weld can 
be predicted. 

It is known that estimates of strength using the work 
necessary to form two surfaces in place of an interface,‘ 
the work of cohesion, are larger than observed strengths 
by a factor of at least 100. Thus, it can be assumed 
that if the work required to separate the friction couple 
at the interphase interface is as large as the ideal work 
required to separate the weaker component at a grain 
boundary, then separation will certainly occur within 
the weaker component. This assertion is based on the 
assumption that there are no imperfections at an inter- 
phase interface, which can lead to a lower strength than 
that associated with the imperfections in a grain bound- 
ary, or the interior of the weaker component. The work 
of separation at the interphase interface is the work of 
adhesion in surface chemistry nomenclature, i.e., 


W as= YatVs— Yas; 


where y; surface free energy of phase 7 per unit area, 
Yas interphase interface free energy per unit area. Thus, 
if Was>Waa, solid phase welding will occur. (In this 
case, a is assumed to be the weaker phase.) Bondi has 
collected values of Wag and Waa.*® These values are 
shown in Table VI together with the conclusions drawn 
from friction measurements concerning solid phase 
weldability. It is apparent that solid phase welding 
occurs for the couples which have work of adhesion 
values on the order of work of cohesion values. Also, 


*F. Seitz, Physics of Metals (McGraw-Hill Book Company, 
Inc., New York, 1943), p. 167. 
5 A. Bondi, Chem. Revs. 52, 417 (1953). 
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it is apparent that couples, which have work of adhesion 
values less than about 3 the corresponding work of 
cohesion, solid phase weld to a lesser extent, if at all. 
In this respect it is interesting to note that metal-oxide 
interfaces, which are normally considered to be re- 
sistant to solid phase welding, have corresponding work 
of adhesion values on the order of 500 ergs/cm? or less. 
This value is about the same magnitude found for the 
Ag-Fe couple. The couples which weld have work of 
adhesion values exceeding about 1500 ergs/cm?. This 
correlation is not strictly rigorous because the strength 
properties of the weaker component will govern the 
critical value below which welding will not occur and 
above which welding will occur for the metal in ques- 
tion. Perhaps a more representative criterion would be 
the quotient of the work of adhesion and the strength 
of the weaker component. 


Seizability of Titanium 


According to the results shown in Tables II-IV the 
cleanliness of the Ti surface is not changed by cutting 
in air, cutting in argon or nitrogen, or subsequent ex- 
posure of the surfaces prepared in the latter condition 
to air. Values of A, for the Ti slider are nil except for 
the Ag rider. The Fe surface however, changes its 
cleanliness as shown by the finite values of the quantity 
A; for the Fe slider shown in Table IV. Similarly for 
the Cu surface, the question arises as to whether the 
Ti surface is equally “‘unclean” or equally “clean” in 
the atmospheres. The friction data for the Cd, Cu, Pb, 
and Zn riders in the “clean” condition, which have the 
same values on both the Fe and Ti surfaces imply that 
welding is occurring equally well to the Ti surfaces as 
to the Fe surfaces, even in air. Thus, it appears that the 
Ti surfaces are effectively clean in that whatever layer 
exists on the Ti surface does not prevent solid phase 
welding to the degree that the layer produced on Fe in 
air prevents such welding. In order to check this idea a 
surface was machined on a Ti specimen in air according 
to the same procedure used for the data in Table II. 
This specimen was then heated in air at 350°C for 17 
hours. No distinct oxide layer was observed visually, 
however, this treatment so altered the Ti surface as to 


TABLE VI. Correlation between work of adhesion and 
solid phase weldability. 








Static friction coefficients 





Both Same surfaces Work of 
Friction surfaces as tested adhesion 
components clean in air ergs/cm? 
Ag on Cu 0.75 0.29 2330 
Cu on Fe 0.72 0.50 2675 
Ag on Fe 0.48 0.47 712 
Cu on SiOz 0.10 0.10 325 
Work of cohesion at grain boundaries, ergs/cm? 
Metal 
Ag 1780 
Cu 2260 
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FRICTION OF CLEAN 


reduce the friction coefficients for the Cu-Ti and 
Mg-Ti couples from about 0.7 to about 0.2 as shown in 
Table V. Thus, it can be concluded that a freshly 
machined surface of Ti is clean in the sense that no 
“lubricating” oxide layer exists on it. Such a layer can 
be produced by heating the Ti in air. 

A question still remains to be answered. Although no 
lubricating quality oxide film may exist on the Ti 
surface, such oxides may exist on the rider surfaces, 
in air. However, the coefficients measured in air (after 
the very first run) are the same as those for clean sur- 
faces for many of the couples comprising Ti. The 
question is whether Ti can weld to some oxides as well 
as it can to the clean metal. In order to check this point, 
Ti and Cu riders were run against a number of crystal- 
line metal oxides. The data are presented in Table VII. 
It is apparent that the friction coefficients are the 
same, within the experimental uncertainty, for both 
the Cu and Ti riders, with the possible exception of Ti 
on rutile. From this result, it must be concluded that 
Ti does not weld to oxides to any greater extent than 
other metals, and, if welding occurs, the degree of weld- 
ing is much less than it is for Ti-metal couples. The 
abnormal seizability of titanium must therefore be 
ascribed to the thinness (or absence) of the contaminant 
(oxide?) film formed on it in air, at room temperature. 
This film, if it exists, cannot be resistant to penetration 
or removal under sliding contact. This conclusion is 
reasonable in view of the fact that the solubility of 
oxygen in titanium is extremely large. Thus, the 
formation of an oxide film at room temperature would 
require the oxygen concentration at the surface to 
reach the value at the terminal phase boundary—a 
process which may require considerable time at room 
temperature. 


Summary of Results and Conclusions 


1. Solid phase welding accounts for almost the entire 
magnitude of the static friction coefficient for most 
metal-metal couples. 

2. Some couples, comprised of metals which are 


+ This conclusion may need to be modified for the couple 
Ti-MgO. Table IT shows that for the couples Mg-Fe and Mg-Cu 
the friction coefficients are low and unchanged by cleaning. This 
result is interpreted to mean that the Mg picks up oxygen even 
from the inert atmosphere quickly enough to form an effective 
protective film to Fe and Cu, but because of the high friction 
coefficient on Ti, this film is not protective against Ti. 
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TABLE VII. Values of static friction coefficients 
against crystalline oxides. 








Tested in nitrogen 
Tested in air Rider unclean Rider clean 





Ti on corundum (AlI,03) 0.40-0.50 0.40 0.40-0.60 
Ti on quartz (SiOz) 0.15 0.15 0.15 
Ti on zircon (ZrOz) 0.30 0.30 0.30 
Ti on rutile (TiO2) 0.20 0.25 0.40 
Ti on magnetite (Fe;0,) 0.20 0.25 0.25 
Ti on hematite (Fe20;) 0.158 0.15 0.15* 
Cu on corundum 0.55 0.35 0.45 
Cu on quartz 0.10* 0.15%  0.10* 
Cu on zircon 0.20 0.25 0.25 
Cu on rutile 0.20 0.20 0.20 
Cu on magnetite 0.25 0.25 0.25 
Al on magnetite 0.20 0.20 0.20 
Zn on magnetite 0.20 0.25 0.30 
Cu on hematite 0.20* 0.20* 0.25 








® Stick-slip absent. 


mutually insoluble in the solid state, solid phase weld; 
some do not solid phase weld. 

3. A measure of the solid phase weldability appears 
to be the quotient of the work of adhesion to the strength 
of the weaker component. 

4, Titanium does not solid phase weld to oxides to 
any greater extent than Cu. 

5. Freshly cut titanium solid phase welds to other 
metals in frictional contact. The degree of welding is 
independent of the atmosphere (air, nitrogen, or argon) 
or the time (less than 24 hours) at room temperature in 
such atmospheres. This behavior is contrary to that 
experienced with iron or copper. With the latter couples 
less than five minutes exposure of a freshly cut surface 
to air is sufficient to markedly reduce the degree of 
solid phase welding. 

6. The seizability of titanium is probably due to the 
lack of resistance, of the contaminant film formed on it 
in air, to penetration or wear in frictional contact. 
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The objective of this work is to determine the orders of magnitude of the band width available from a 
cavity antenna. Fcr simplicity of calculation a piece of shorted square wave guide of side a, opening into a 
half-space, is considered, and the resonance frequency and band width (1/Q) are calculated from trans- 
mission-line theory. The analysis depends on an equivalent aperture admittance which is defined for the 


lowest-order wave guide mode. 


This type of antenna has a minimum Q which is equal to 0.424(a/A)~* when (a/A) <0.35. The cavity 
depth required for the minimum Q is of the order of 0.42. The required dielectric constant increases rapidly 
as a/d decreases. The effect of dielectric loss is considered. Some experimental] results which agree well with 


theory are presented. 





INTRODUCTION 


T is well known that the band width of an electrically 
small cavity antenna is severely limited by the aper- 
ture dimensions. However, many theoretical aspects 
of this phenomenon are undeveloped. In the present 
investigation the resonance frequency and band width 
will be calculated for a length of shorted square wave 
guide opening into a half-space. This is a restricted 
class of antennas, but the results are indicative of what 
can be expected from other practical shapes. It will be 
shown that this configuration has a maximum band 
width which increases as the cube of the frequency. 

Most previous work on this subject has used the 
stored energy definition of Q. Chu' used the orthog- 
onality of spherical waves to set up an equivalent 
circuit from which he found relations among size, gain, 
and Q. Counter? considered antennas of the type con- 
sidered here, and attempted to find the stored energy 
by a direct integration procedure. Some of his results 
are similar to those reported here. 

In this paper Q is found from the normal modes. 
Such modes are the possible natural oscillations which 
are more commonly associated with closed systems 
such as networks or resonant cavities, but which never- 
theless are assumed to exist for antennas. Heretofore, 
they have been discussed only for the simplest types of 
structures, including spheres, spheroids, and thin 
wires.?* 

It is assumed that an antenna has an infinite discrete 
set of modes and that their time dependence is of the 
form exp(jpnt), where p, is a complex number, 


Pn=Wnt jén, &,>0. (1) 


* This paper is a portion of the dissertation presented by the 
author as part of the requirements for the degree Doctor of 
Philosophy at The Ohio State University. The dissertation has 
been published as a technical report of the Antenna Laboratory 
(see reference 5). 

1L. J. Chu, J. Appl. Phys. 19, 1163 (1948). 

*'V. A. Counter, Miniature Cavity Antennas, Quarterly Reports 
Nos. 2-6, Microwave Laboratory, Stanford University, 1948-1950. 

*J. A. Stratton, Electromagnetic Theory (McGraw-Hill Book 
Company, Inc., New York, 1941), p. 554. 

4S. A. Schelkunoff, Advanced Antenna Theory (John Wiley and 
Sons, Inc., New York, 1952), Chap. 6. 


Each mode has a Q defined by 
Qn=wn/(2En). (2) 


The complex frequency /, allows us to keep the 
formal solutions to the wave equation, but some new 
interpretations are necessary. For example, it can be 
shown that a normal mode wave has an exponential 
envelope which increases in the direction of propaga- 
tion. The mathematical expression describes the wave 
as increasing without limit, but since the corresponding 
physical situation (transient oscillation) was started at a 
finite time in the past, the wave will be zero outside of 
some sphere surrounding the antenna. Further in- 
teresting effects occur when the cavity is a section of 
uniform wave guide.® 

The customary proofs of orthogonality and com- 
pleteness cannot be applied to the set of normal modes, 
because of the behavior at infinity. Hence it cannot be 
said that any transient oscillation (including the steady- 
state condition) is representable as a linear combination 
of the modes. However, the usual procedure in reso- 
nance problems is to assume that each steady-state 
resonance corresponds to one of the normal modes. We 
will do this with cavity antennas. 


CALCULATIONS FOR A SQUARE WAVE GUIDE 
CAVITY ANTENNA 


Consider a piece of square wave guide, shorted at one 
end and opening into a half-space at the other (Fig. 1). 
This antenna may be treated on a transmission line 
basis, with terminals at the aperture, provided the 
field in the wave guide consists essentially of one wave- 
guide mode, and the aperture admittance is known for 
that mode. 

In the following we assume that these conditions are 
met. Therefore, since there are no sources, the system 
(Fig. 1) oscillates when 


Y,+y, coth(y,/)=0, (3) 


where =antenna (normal) mode index, Y,,= aperture 


5M. H. Cohen, The Normal Modes of Cavity Antennas, Tech- 
nical Report 486-7, Antenna Laboratory, Ohio State University 
Research Foundation, Columbus, Ohio, December, 1952. 


582 











ac 
Y: 





1e 
1e€ 
1€ 
e- 
or 


re 


3) 


re 


ty 








ON THE BAND WIDTH OF CAVITY ANTENNAS 583 


admittance, y,= wave-guide characteristic admittance, 
Yn=wave-guide propagation constant. The wave- 
guide parameters are given by® 


Yn=n/(jpnu) for TE modes, 
n= jpné/Yn for TM modes, (4) 
17= c— a, 


where k,?=p,”ue, x= (2r)Xcut-off wave number of 
wave-guide mode. 

Equation (3) may also be obtained by a variational 
calculation for the eigenvalues p,, under the assump- 
tion that the distribution of tangential electric field in 
the aperture is a single wave-guide mode function.° 
A similar variational calculation’ for the aperture ad- 
mittance for a square wave guide, under the assump- 
tion that the tangential aperture field is the TE, 
mode distribution, yields the following for the aperture 
conductance G and susceptance B: 


G=0.009012?—0.01522x*+-0.01436x° 
—0.00848x5+- - - - 


5 
B= —0.000809x'+-0.00537x—0.01278 (5) 
+0.01596«5—0.01161x7+----, 
where 
x=ap/(2me), 
: (6) 


c= velocity of light in vacuum. 


When ? is real, x=a/X. 


A 


pn fmm 








Fic. 1. Square wave-guide 
cavity antenna. 





Let the medium in the wave guide have the electrical 
constants p= yo and e=€,€9, where ¢, is real. Then, in 
terms of the dimensionless variables x, }/a, and ¢,, the 
eigenvalue equation [Eq. (3)] for the TE, mode 
becomes 


f (x)= — j(€0/mo)*(2xx)—'Ba cotBl+-G(x)+ j7B(x)=0, (7) 


®N. Marcuvitz, Waveguide Handbook (McGraw-Hill Book 
Company, Inc., New York, 1951), M.I.T. Radiation Laboratory 
Series, Vol. 10, Chap. 1. 

7 Cohen, Crowley, and Levis, The Aperture Admittance of a 
Rectangular Waveguide Radiating into Half-Space, Project Report 
339-22, Antenna Laboratory, The Ohio State University Re- 
search Foundation, Columbus, Ohio, November 1951. For an 
alternative treatment, see L. Lewin, Advanced Theory of Wave- 
guides (Iliffe and Sons, Ltd., London, 1951), p. 121. 
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Fic. 2. Relations among cavity parameters for first normal mode 
of square wave-guide cavity antenna. 


where 


B= — jy= (x/a) (4¢,2°—1)4. (8) 


A solution x, of Eq. (7) is obtained by expanding 
f (x) in a Taylor’s Series about the value xo, which is 
obtained when G=0. For the range of values under 
consideration the first two terms of the series give 
adequate accuracy.® Thus ~x, is given by 


—G(xon) 
0 f/OxXon 


The real part of x, is a@/A,, and the imaginary part 
gives the decay constant. The corresponding Q, is 


Xn=Xont 


(9) 


Wn a/Xn 
Gato wenmee, (10) 
2&, 2Im(xn) 





The calculations for the first mode indicate that 
when Q is high (Q>7) the following approximations 
are valid: 





a/An=A/Don (11) 
0 Wn OBr 

~ een ? 12 

2G dwn (12) 


where Br is the total admittance at the aperture. It 
is interesting to observe that Eq. (12) is the same as 
the approximate equation for Q obtained from circuit 
theory. 


8 The validity of keeping only the linear terms was checked by 
computing the second derivative at one point. The point chosen 
was a/A=0.60, ¢-=1; this is a point of very low Q and the ap- 
proximation should be worst here. The inclusion of the second 
derivative lowered Q by 3 percent and lowered a/d by 0.7 percent. 
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Fic. 3. Resonance characteristics of first normal mode of square 
wave-guide cavity antenna. 


Results for First Normal Mode 


A series of numerical calculations was made for the 
first normal mode of the square wave-guide cavity 
antenna, according to the above procedure. The 
parameters of interest are //a, e,, a/A, and Q. (The sub- 
script » has been dropped for simplicity.) Since it is 
often desired to design a cavity for a specific resonance 
frequency, a/d is chosen as an independent variable. 

Figure 2 shows the dependence of //a on a/X for 
various values of ¢,. It is seen that there is a variety of 
pairs of «, and //a which produce the same a/\. The 
individual curves asymptotically approach that value 
of a/X which corresponds to cutoff. 

The values of Q which result from the various con- 
figurations of Fig. 2 are shown in Fig. 3. The envelope 
of the family of curves in this graph corresponds to 
the minimum Q. Each value of a/d has a specific pair 
of values of ¢, and //a which are necessary to obtain 
the minimum Q. 

The minimum () was separately evaluated by maxi- 

mizing the denominator in Eq. (10) with respect to e¢, 
~ and J/a. The result is shown in Fig. 4, where the mini- 
mum ( is represented by the solid line. It is interesting 
to note that this is closely approximated by the formula 


QOmin=0.424(a/d)-, (13) 


H. COHEN 


which is represented by the dashed line in Fig. 4. The 
cubic dependence may be readily derived by consider- 
ing the aperture admittance to be small perturbation 
from an open circuit. The absolute values of Qmin so 
obtained, however, are somewhat smaller than those 
given by (13). 

Figures 5 and 6 give the values of //a and e, which 
produce the minimum Q. Note that these values are not 
critical, since Q is a relatively slow function of ¢€,, near 
the minimum (Q. 

Some typical numerical results for the first mode 
are shown in Table I. The first four columns include 
results which are represented in the graphs. The fifth 
column, §l, is the electrical length of the cavity at 
resonance. The columns labeled stored energy are ob- 
tained by assuming that 0B7/dw is proportional to the 
total stored energy of the system, and that it may be 
split in the same way as the susceptance, i.e., into ex- 
ternal and internal portions. This assumption is, of 
course, an approximation, which is valid for high-O 
antennas. It becomes true as Q increases and the fre- 
quency becomes real. It is seen from Table I that most 
of the stored energy is in the cavity. This was noticed in 
every calculation that was made for Q, including those 
for higher modes. 
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Fic. 4. Minimum Q versus a/d for first normal mode of square 
wave-guide cavity antenna. 


® Counter (reference 2) essentially used an open circuit boundary 
condition. He also obtained a cubic dependence for minimum Q. 
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Effect of Dielectric Loss 


Losses in the dielectric in effect simply add a term to 
the aperture conductance. When (Q is high in place of 
(12) we now have 


Wn OBr 
On=— 








(14) 


The dielectric loss does not effect the resonance fre- 
quency so long as Q remains high, but it may have an 
appreciable effect on Q itself. The additional term G 
may be found by letting ¢, be a complex number and 
expanding f(x,¢,) about the points xo, which were used 
previously. 

From Eq. (14) one may write 


1/O7=1/Qr+1/Qn, (15) 


where (7 is the total Q, Qz is the radiation Q given by 
Eq. (12), and Qp is the dielectric 0. When Qr is high 
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Fic. 5. Geometry necessary to obtain minimum Q. 


it may be shown that 


1/Op= tané, (16) 
where 
tané= loss factor=a/ (we). (17) 


Hence we have the approximate formula 
1/Or~1/Qrt+tané. (18) 
The radiation efficiency is Or/Qr; hence 
efficiency ~ (1+Q. tané)"". (19) 


This formula illustrates the principle that a high-Q 
system fundamentally has a low efficiency. 


CORRELATION WITH STEADY-STATE BEHAVIOR 


The problem now arises as to what these modes have 
to do with the steady-state antenna. An extensive litera- 
ture on this subject already exists,'*-' and it has been 


1 E. A. Guillemin, Communication Networks (John Wiley and 
Sons, Inc., New York, 1935), Vol. II, Chap. 5. 

1S. A. Schelkunoff, Proc. Inst. Radio Engrs. 32, 83 (1944). 

2 Montgomery, Dicke, and Purcell, Principles of Microwave 
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Fic. 6. Dielectric constant to obtain minimum Q. 


shown that when the antenna has a high Q it is approxi- 
mately represented by the network in Fig. 7. The RLC 
circuit is supposed to represent the cavity in both the 
steady-state and transient cases, and thereby the con- 
nection between normal modes and impedance of the 
antenna is made. Thus, the normal mode oscillation 
frequency and Q are equal to the steady-state resonance 


TABLE I. Typical results for first normal mode of 
square cavity antenna. 








Stored Stored 





energy* energy* 
in in 
a/r €r l/a Q Bl cavity half-space 
0.349 3 0.886 12.4 109.3° 0.0542 0.0083 
0.349 5.42 0.434 9.8 100.6° 0.0422 0.0083 
0.349 15 0.211 12.0 95.5° 0.0535 0.0083 


0.271 3.88 2.00 71.9 
0.125 25 0.935 336 
0.125 58.3 0.371 217> 
0.125 80 0.293 222 


134.1° 0.296 0.014 
126.2° 0.682 0.057 
108.7° 0.420 0.057 
105.4° 0.432 0.057 








® The numbers are approximately proportional to stored energy. 
» Minimum O for this (a/A). 





Circuits (McGraw-Hill Book Company, Inc., New York, 1948), 
M.I.T. Radiation Laboratory Series, Vol. 8, Chap. 7. 

13C. G. Montgomery, Technique of Microwave Measurements 
(McGraw-Hill Book Company, Inc., New York, 1947), M.I.T. 
Radiation Laboratory Series, Vol. 11, Chap. 5. 

4 J. C. Slater, Revs. Modern Phys. 18, 467 (1946). 

‘6 The Q used here is the so-called “unloaded Q” of the above 
references. 
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Fic. 7. Equivalent circuit near resonance. 





frequency and Q. The antenna ( is related to the im- 
pedance band width on the input transmission line in 
exactly the same fashion as the Q of the RLC circuit is 
related to the band width on the transmission line in 
Fig. 7. In this statement the assumption is made that 
the feed, i.e., the method of exciting the cavity, does 
not appreciably perturb the cavity fields from the 
normal mode fields. This point is discussed below. 
When the effect of the series reactance (Fig. 7) is 
negligible it may be shown that Q is simply related to 
the standing wave ratio on the transmission line. Let 


R_  ,SWR at resonance, if R/Z)>1 
90> = (20) 
Z, ‘'1/SWR at resonance, if R/Z9<1. 
Then 
Q=wn/ (w2—w1), (21) 


where w, is the resonance frequency, and w2 and w are 
the frequencies'® at which the SWR has risen to the 
value r;, where 


(1+ 4067+ 00)'+ (1+ 4007—o0)! 
(1+ 4oe-+o0)!— (14-400?) 





ry 


(22) 


Experimentally, the ambiguity in o» may be resolved by 
examining the impedance curve to determine whether 
R/Z, is greater or less than one at resonance.”—"4 

A slightly different value is obtained by including 
the series reactance. Under the assumptions that it is 
constant over a wide frequency range, and that the 
SWR far from resonance is what would be obtained at 
the resonance frequency, if the resonant circuit were 
removed, it is possible to show that 


[(1+00)?+ (1+-01)7}#+[(1+00)?— (1+01)*}} 
ro> ’ 
[ (1+00)?+ (1+601)*}}—[(1+00)?— (1+61)?}! 





(23) 


where rz is the standing wave ratio at w; and w2. The 
symbol a» is defined above in terms of SWR at reso- 
nance, and o; is 1/SWR far from resonance. Equations 
(22) and (23) give similar results when Q>1. 


Experimental Results 


A series of measurements were made on three square 
wave-guide cavity antennas. They had !/a=7.67, 2.00, 


16 This band width (w2—w;) can be found from a Smith Chart 
impedance curve of the antenna. The curve is simply rotated 
until it coincides (as closely as possible) with a constant resistance 
circle. The band-width is that portion of the curve which can then 
be renormalized into the 2.42 SWR circle. The value of 2.42 is the 
minimum value of 7. 


H. COHEN 


and 0.40; with ¢,=3.80, 3.88, and 8.41, respectively. 
The range of measurement covered 6 modes of the first 
antenna, 4 of the second, and 1 of the third. The reso- 
nance frequencies fell in the range 0.26< (a/A) <0.32. 

The resonance frequencies and Q’s were obtained 
from curves of SWR vs frequency, as described above. 
A typical curve is shown in Fig. 8. The TE, mode 
propagates above ¢/A=0.358, so apparently the asym- 
metry of the third and fourth modes is due to an inter- 
action between the TE, and the TE, wave-guide 
modes. (This interaction was prohibited in the third 
cavity by the use of an image plane which excluded the 
TE, mode.) 

Every resonance which was due to the TE, wave- 
guide mode alone was carefully measured, and the re- 
sults are given in Tables II, III, and IV. Two values of 
measured Q are shown in the tables. The value obtained 
from Eq. (22) is called Q,, and that from Eq. (23) is 
called Q2. There are also two values of calculated Q in 
the tables. Qe is the radiation Q, assuming only radia- 
tion losses, while Qr includes dielectric losses. The first 


40-——-- --- , ——— -, —-—— ,- —- — 
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Fic. 8. SWR for //a=2, ¢-=3.88, antenna, with long probe. 


two cavities were filled with sulfur for which the loss 
factor" is 0.0025. This factor is unknown for the medium 
in the third cavity, a lead-chloride mineral-oil mixture. 
The third cavity has a low Qe and therefore dielectric 
losses have relatively little effect. 


Effect of Feed 


The theory outlined above assumes that the wave 
guide is uniform; but as a practical matter the cavity 
has to be excited, and this is accomplished through a 
hole in the wall, with possibly some metal protrusions. 
In general this feed will distort the field and change the 
resonance frequency. The direction of frequency shift 
can be found from the general perturbation theory 
of resonant cavities. According to this theory a metal 
protrusion in a region of high E and low H should 
lower the resonance frequency, whereas one in a region 


7 A. Von Hippel, Tables of Dielectric Materials, Laboratory of 
Insulation Research, Massachusetts Institute of Technology, 
Cambridge, Massachusetts, June 19-48. 
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of low £ and high H should raise it. This effect can be 
seen in the tables. 

The first antenna was excited by a loop in the rear 
wall. The loop was much smaller than the size required 
to produce a match at resonance and hence caused a 
small perturbation. The calculated and measured reso- 
nance frequencies agree almost exactly. The second and 
third antennas were excited by probes close to the 
aperture, in a region of high electric field. It was found 
that a match could easily be obtained by varying the 
probe length, and in Tables III and IV the long probe 
is the one which gives a match. The long probe caused 
a considerable perturbation which lowered the reso- 
nance frequency by as much as 8 percent. The resonance 
frequencies obtained with short probes were closer to 
the computed values, as expected. 


Discussion of Q 


The validity of the various assumptions which enter 
into the calculation are of more consequence to Q than 
to resonance frequency, essentially because a non- 


TABLE IT. Results for !/a=7.67, ¢-= 3.80, loop-fed, cavity antenna. 

















Meas- Calcu- Meas- Meas- Calcu- Calcu- 
ured lated ured ured lated lated 
Mode a/r a/d Qi Oz Or Or 
18 tee 0.258 tee ‘* 1470 344 
2 0.263 0.264 104 134 430 220 
3 0.273 0.273 89.6 114 240 156 
4 0.285 0.285 82.5 97.7 172 124 
5 0.301 0.300 76.9 90.4 137 105 
6 0.319 0.318 67.1 75.6 119 93.4 
® This mode was not observed. 


infinite Q is a second-order effect. For example, in 
many cases the dielectric loss has to be included to ob- 
tain a meaningful Q, whereas its effect on resonance 
frequency is negligible. In some cases the metal losses 
also might be significant, but they were ignored in the 
present calculation. The measured Q values are gen- 
erally less than the calculated ones, and metal losses are 
at least partly responsible for this. 

The assumptions involved in the use of an equivalent 
circuit to find Q should be good when Q is high and the 
mode is isolated, i.e., far from its neighbors. The Q»2 
values are higher than the Q, values and thus are closer 
to Or (except for the one case in Table ITI), presumably 


TABLE III. Results for first two normal modes of 
l/a=2.00, e-=3.88, prove-fed cavity antenna. 








Meas- Calcu- Meas, Meas- Calcu- Calcu- 
ured lated ured ured lated lated 
Mode Probe a/r a/r Or Oz Or Qr 


long 0.265 0.271 396 410 719 61.9 
short 0.269 0.271 35.2 418 719 61.9 
long 0.317 0326 30.7 324 32.7 30.4 
short 0.324 0.326 22.3 260 32.7 30.4 





NN ee 








TABLE IV. Results for first normal mode of //a=0.40, e-=8.41, 
probe-fed, cavity antenna, with image plane. 











Measured Calculated Measured Measured Calculated 
Probe a/d a/r Or Oz Or 
Long 0.280 0.305 13.7 15.0 15.8 
Short 0.289 0.305 11.8 14.4 15.8 








because they are derived from a more elegant equivalent 
circuit. 

The agreement obtained between measured and cal- 
culated Q is reasonable under these circumstances. 


CONCLUSIONS 


The normal modes of cavity antennas have been dis- 
cussed and it has been shown how calculations may 
easily be made for them when the antenna is essentially 
a wave-guide cavity, and the aperture admittance is 
known. The experiments indicate that the theory repre- 
sents the major resonance effects. 

It has been demonstrated that a square wave-guide 
cavity antenna has a minimum Q, which is very high 
when a/d is small. In view of the well-known fact that 
small-aperture antennas invariably have a high Q, it 
is felt that the minimum Q curves give the order of mag- 
nitude of the best that can be done with other shapes 
of cavities and other loading schemes. The minimum Q 
of a cavity antenna is thus determined primarily by the 
major aperture dimensions. 
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When a shock is incident upon a corner from either the concave or convex side it is reflected by the walls 
forming the corner and diffracted at the edge. The mathematical problem corresponding to this phenomenon 
has not been solved analytically except in the following limiting two-dimensional cases: a weak (acoustic) 
shock incident on any corner (Keller and Blank) and a finite shock incident either normal or parallel to a 
nearly flat wall (Bargmann, Lighthill, Ting, and Ludloff). In this article it is pointed out that there are 
certain special cases of a finite shock entering a corner from the concave side in which no diffraction occurs. 
In these cases it is therefore possible to solve the entire problem by using the theory of regular reflection, 
which yields an explicit solution by algebraic means alone. 


I. INTRODUCTION 


HEN a shock enters a corner it is reflected one or 
more times from the walls forming the corner. 
Finally it reaches the corner and then, generally, it is 
diffracted. It is the purpose of the present article to 
point out that in special cases no such diffraction occurs. 
In these cases it is therefore possible to solve the entire 
reflection problem exactly and explicitly by algebraic 
means only. 

To exemplify this result the solutions are found for 
several particular cases. These solutions show that the 
pressure on the walls increases tremendously because of 
the multiple reflections. It therefore seems that such 
corners are very susceptible to damage from explosion 
shocks, and should be avoided in designing structures 
likely to be exposed to such shocks. 

The fact that solutions without diffraction can occur 
is indicated by the results for acoustic pulses entering 
corners. These results are given in “Diffraction and 
Reflection of Pulses by Wedges and Corners” by A. 
Blank and the present author [Communs. Pure and 
Appl. Math. IV, No. 1 (June, 1951) ]. In that article it is 
shown that in the two-dimensional case with plane 
pulses incident theré are certain corner angles x/2n (n 
any integer) for which no diffraction occurs. This result 
holds for any angle of incidence of the acoustic shock, 
and these angles are just those for which every optical 
ray is reflected back parallel to itself. 

It is therefore natural to seek corresponding cases in 








(a) (b) 


Fic. 1. Acoustic pulse symmetrically incident in a right-angled 
corner. The numbers indicate excess pressure in units of the 
incident excess pressure. (a) Before shock reaches corner. (b) After 
shock reaches corner. 

*On leave of absence during 1953-4 at the Office of Nava! 
Research, Washington, D. C. 


which finite shocks are reflected without diffraction. 
Just as in the acoustic problem, such cases arise when 
that portion of the shock front which suffers its final 
reflection from one wall emerges parallel to that portion 
which suffers its final reflection from the other wall. If 
such cases occur, and if all reflections are regular, the 
flow field will be piecewise constant and obtainable by 
algebraic calculations, i.e., by calculating a finite 
number of regular reflections. 


Il. THE RIGHT-ANGLED CORNER 


In order to construct an example of such a case, let 
us begin by examining the acoustic solution for a plane 
shock incident symmetrically in a right-angled corner 
with rigid walls. The solution at two typical instants 
is shown in Fig. 1, the numbers indicating excess pres- 
sure in units of the excess pressure of the incident 
shock. This solution is given by Courant and Friedrichs,! 
and it is almost obvious. 

Let us seek a finite shock solution in which the shock 
configuration is the same as that of Fig. 1, but in which 
the pressure values are necessarily different. If we 
indicate the state of the gas in each region in Fig. 1 by 
one of the numbers 0, 1, 2, 3, 4, then we may designate 
the corresponding pressures by po, 1, p2, ps, pa. In 
Fig. 1 the angle of incidence of each shock with respect 
to the wall from which it is reflected is 7/4 (=45°) and 
each angle of reflection is also 7/4 (=45°). Now in 
general the angles of incidence and reflection are un- 
equal in finite shock reflection. However for one partic- 
ular angle a in each gas they are equal. In a polytropic 
gas with adiabatic exponent y this angle a is given by 
(reference 1, p. 327) 

y—1 
cos2a=——. (1) 


If a=45°, as in Fig. 1, then y=1. Thus the configura- 
tion of Fig. 1 is possible with an incident finite shock of 
any strength provided y=1. 

To find the pressures, we assume that fo and f; are 
given. Then p2 is determined by the conditions at a 


1R. Courant and K. O. Freidrichs, Supersonic Flow and Shock 
Waves (Interscience Publishers, Inc., New York, 1948). 
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regular reflection which yield when y=1 (reference 1, 
pp..331, 327) 
pr 
p2=—fi. (2) 


Po 


To find p3; we note that it is the pressure resulting from 
the head-on collision of two shocks and this is equivalent 
to a regular normal reflection. Thus 








(p2)>  /bi\? 
p3= = ~) pi. (3) 
pi Po 
Finally, ps results from another regular reflection so 
(p3)”  /hi\? 
pa= ={ — } pi. (4) 
pe Po 


All other quantities can be found from the formulas in 
reference 1, p. 330. The preceding results also apply to 
the corner of angle 2/4 obtained by bisecting Fig. 1. 








(a) (b) 


Fic. 2. Shock configurations which may result from a finite 
shock symmetrically incident in a corner. Cases in which this 
occurs are shown in Fig. 4. The numbers denote the regions. (a) 
Before shock reaches corner. (b) After shock reaches corner. 


It is interesting to note that p, is greater than p; by 
the factor (p:/0)® which equals 1000 if 1/p0= 10. Thus 
ps may become very large. This fact must be approxi- 
mately true for other values of y and other corner 
angles as well. 

For other values of y the configuration in Fig. 1 is not 
possible, even with any other corner angle. This is so 
because in Fig. 1 the angles of incidence and reflection 
are complementary for any corner angle. Furthermore 
the angle of reflection in Fig. 1(a) is the angle of inci- 
dence in Fig. 1(b). But when the angles of incidence and 
reflection are complementary, the latter must always 
exceed the former (see, e.g., reference 1, p. 328, Fig. 55). 
This cannot apply in both cases unless the angles of inci- 
dence and reflection are equal, which was the case 
treated above. 

Thus to obtain solutions for y#1 we must consider 
configurations such as those in Figs. 2 and 3 rather than 
Fig. 1. To analyze either of these cases we must specify 
the corner angle, the state 0, the state 1, and the 
direction of incidence, which is along the angle bisector 





(a) (b) 


Fic. 3. Shock configurations which may result from a finite 
shock symmetrically incident in a corner. No cases in which this 
occurs have been found in this paper. (a) Before shock reaches 
corner. (b) After shock reaches corner. 


in the figures. Then we must determine state 2 by 
regular reflection, state 3 by regular reflection from state 
2, and state 4 by regular reflection from state 3. This 
analysis will yield a solution provided that the last 
shock between states 3 and 4 is perpendicular to the 
bisector of the corner angle (for symmetric incidence), 
since only then will the two halves of this shock form a 
straight line. Thus the specified data must be adjusted 
until this last condition is satisfied. 

If the gas, the state 0, and the direction of incidence 
are fixed, then the incident shock strength and the cor- 
ner angle remain adjustable. It seems that these two 
variables can always be adjusted so as to yield an ex- 
plicit solution of the type considered above. In fact for 
either of the variables fixed in an appropriate range, it 
seems that a solution can be found by adjusting the 
other. Therefore in the plane in which the incident 
pressure ratio p:/po and the corner angle @ are coordi- 
nates, there is a curve corresponding to diffractionless 
reflection. As p;/o tends to unity, this curve must tend 
to 0=7/2 for any y and any direction of incidence, as 
the acoustic solution shows. For symmetric incidence 
and y= 1 this curve is just the vertical line @=2/2. For 
symmetric incidence and y# 1 the curve is shown in Fig. 
4 starting at the acoustic point p:/po=1, 0=2/2. When 
y>1, then @>/2 as the curve shows. Of course there 
are undoubtedly other branches of this curve corres- 
ponding to the other angles 7/2n which occur in the 
acoustic case. The method by which this curve was 
obtained is described in the next section. 


III. CALCULATION FOR y#1 


In order to find the curve in the (fo0/p1—8)-plane 
which gives diffractionless reflection, we restrict our 
attention to symmetric incidence and the configuration 
shown in Fig. 2. We assume that the incident pressure 
ratio &") = p/p,=1+-7 where 7» is small and negative. 
We also assume that 0= (x/2)—an?+--- where a is to be 
determined. This form of @ was decided upon after 
examining the equations. Now we follow reference 1, 
pp. 329-330, setting to =tanao™, to“ = tana,"?. Three 
superscripts 1, 2, 3 will be used, referring to the first, 
second, or third reflection, respectively. Then we have 
(reference 1, p. 330, Eqs. 127.04, 127.08) with (y—1)/ 
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(y+1) =p" 
(1—é)lo (¢—1)te 
(ims Ob 
1+ wt= (E+u*) (to)? Ime + (E+?) (t2)” 


M?(1—y?)?(to— t2)-+ M { (1—?)*— (to— te)” 
— (u?+ tote)?} — (to— tz) =0. (6) 


With the chosen values of ¢ and 6 we find ag" = (2/2) 
— (6/2) from the figure, compute M“ from (5), solve 
(6) for t2%, then solve (5) for ¢% = p2/p1. Now ao 
=a2")— (6/2) and to° =tanay® so we compute M® 
from (5) using §® =1/¢™. Then we solve (6) for 
and (5) for ¢®. Again from the figure ap = (6/2) —a2”, 
to =tanay™. Then we compute M® from (5) with 
£%)=1/¢), solve (6) for t2 and (5) for ¢®. Finally we 
require that a2®=ao™ or t="). This condition 
determines @ in terms of ®), or in the perturbation 
problems it yields the constant a. The results of this 
calculation are, to second order in 7, 


First Reflection 




















EV=1+n : (7) 
TT 
¢=-—ar (8) 
2 
to =1+<ar? (9) 
—n n° 
M) = ——___4 - (10) 
2(i+y?) 4(1+4y7) 
2u*n Su? 
££0=1— +] a+ fr (11) 
(1+?) 1+ 4? 
(Y= 1—H+7’. (12) 
Second Reflection 
wn uw? (3+?) 
to) = ~<A + [op (13) 
1+? 2(1+ 4)? 
EO =1+7+0-7? (14) 
ben? 
M®) = ——_ (15) 
(1+y*)? 
un pu? (1+5y*) 
22) = — +[o4Q Ny (16) 
1+ pu? 2(1+u?)? 
It+y? 14+3y 
ae a 
uw 2(1+2’) 
Third Reflection 
2u? uw? (1+3y?) 
th = 1+ ——4-[ 34 |r (18) 
1+y? (1+-y")? 
1+py?Sp?+3 
g@ = ++ a+ hr (19) 
we 2 (u?+ 1) 











KELLER 
n a 2yu?+1 
a |-<+5——r (20) 
2(1+y?) bly? 2(1+4n°)? 

Out Sp? 
{= 1+ 0-n-| 50+- n (21) 

(1+y°)* 

. +e S++ 
(P= -»-| a+ hr (22) 

we 2(1+-u’) 


In order that f2% =)" we find that a in Eq. (8) 
must be given by 


Out+Su? = (1—7) (77-2) 
6(1+n?)? 12y2 0 





(23) 


We see that a=0 at y=1, 2/7, is positive between these 
values and negative outside them. Since usually y>1, 
we see that @ must be increased above 7/2 as the incident 
shock strength is increased, in order to yield reflection 
without diffraction. We note that if y=1, a=0 and 6 
need not change, in agreement with the result of the 
previous section for y=1. For y=1.4 we find a= —13/ 
296. Some of these results are shown in Fig. 4. 





2+ REGION OF NO REGULAR 
REFLECTION 


wc eee eeenoe=-+ 











120 140 160 180 
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Fic. 4. Curves showing conditions in which reflection without 
diffraction occurs as shown in Figs. 1 and 2. The pressure ratio 
across the incident shock is po/ 1, the corner angle is @, and the 
shock is incident symmetrically. The acoustic points po/pi=1, 
6=7/2n with n any integer were found previously (see Sec. I). 
The line @=* corresponds to normal reflection. The dashed line 
6=2/2 (——-) is one branch of the curve for y=1 and was deter- 
mined in Sec. II. The solid curve starting at po/fi=1,0=2/2 was 
computed from Eq. (8) with the constant a= — 13/296 calculated 
from Eq. (23) using y= 1.4 (air). The curve separating the region 
of regular reflection from that in which no regular reflection occurs 
is given by the equation Oext= 7—2aext Where aext is taken from 
[1, p. 326, Fig. 54] and is computed for y= 1.4 (air). The dotted 
branches of the curves are sketched in, but have not been com- 
puted. 


Finally we note that the computational procedure 
outlined above can also be used for shocks that are not 
nearly acoustic. In that case an incident shock strength 
must be chosen as well as a definite corner angle. If 
these choices do not yield a solution one of them (e.g., 
the angle) must be changed and the calculation repeated 
until a solution is obtained. However since the range of 
shocks for which reflection without diffraction occurs is 
limited to shocks of small strength, as is shown in Fig. 
4, the preceding results should be adequate, and further 
calculation seems unnecessary. 
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Measurement of the Effect of Chlorine Treatment on the Work Function 
of Titanium and Zirconium* 


HERBERT MALAMUD AND AARON D. KRUMBEIN 
Physics Department, University of Maryland, College Park, Maryland 
(Received August 20, 1953) 


This paper describes work function measurements of titanium and zirconium carried out by the con- 
ventional photoelectric method. Two samples of each metal were investigated before and after treatment 
with chlorine. The changes in work function due to chlorine absorption were of the same order of magnitude 


as those reported for other halogens on other metals. 





INTRODUCTION 


ORK function measurements were made by the 

photoelectric method! on two samples each of 
titanium and zirconium before and after treatment with 
chlorine. It is well known that most gases occluded on 
a metal surface generally lower the work function (@) 
while a few gases, notably the halogens, raise the work 
function. The present work was done as part of a re- 
search project on halogen counters to determine 
whether the metals under test would be suitable as 
counter cathode materials.2 The metal samples and 
their analysis were supplied by the United States 
Bureau of Mines at College Park, Maryland. The 
titanium was commercially pure, arc-melted metal 
with 0.04-0.06 percent carbon, less than 0.03 percent 
nitrogen, and less than 0.1 percent oxygen. The zir- 
conium used was “hafnium-containing, induction 
melted,” with somewhat more than 0.2 percent carbon. 


APPARATUS 


The experimental arrangement was straightforward. 
An ultraviolet lamp (G.E. Type CH-3) illuminated 
the input slit of a small Gaertner quartz prism spectrom- 
eter which had been fitted with a movable slit (in 
place of the film plate), and a scale, calibrated for the 
identification of the various ultraviolet spectral lines. 

Light from each spectral line in turn selected by this 
monochromator was allowed to enter the quartz 
window of a Pyrex testing tube, and illuminate the 
cylindrical cathode being tested. The anode was a 
tungsten wire coaxial with the cathode. 

The tube was supported in a brass case with the 
quartz end window projecting through a hole punched 
in a sheet of rubber forming one end of the case. The 
inside of the case contained the first stage of a dc 
amplifier and suitable drying agents. The circuit of 
the amplifier was identical with the Victoreen 247B 
radiation survey meter circuit except that our testing 
tube was substituted for the ionization chamber. A 


* This work is supported by the Bureau of Ships under Contract 
NOBsr-49066. 

1A. L. Hughes and L. A. Dubridge, Photoelectric Phenomena 
(McGraw-Hill Book Company, Inc., New York, 1932), pp. 
419-456. 

2D. H. Wilkinson, Jonization Chambers and Counters (Cam- 
bridge University Press, Cambridge, 1950), pp. 44-45. 


Leeds and Northrup Type R galvanometer served as 
the indicator. Figure 1 shows a block diagram of the 
apparatus. 


PROCEDURE AND RESULTS 


Each sample was tested exactly as received and 
again after degassing. The first degassing process was 
accomplished by heating the sample to bright red 
heat (~900°C) in vacuum by means of a radio fre- 
quency heater for about ten minutes. The vacuum 
system consisted of a two-stage water-cooled DPI oil 
diffusion pump, with a dry ice-alcohol trap between 
the pump and the sample. A pressure of less than 10 
cm Hg (the limit of the thermocouple gauge used), 
was obtained even while the degassing was in progress. 
The metal samples were then further degassed and 
after cooling, were exposed to dry chlorine gas at a 
pressure of 10 mm Hg for the first sample of each metal, 
and 20 mm Hg for the second sample. ‘Lecture bottle” 
chlorine gas, dried over phosphorus pentoxide, was 
used. The metals were then allowed to “pickle” in the 
chlorine for about eighteen hours. Measurements were 
made after reevacuation at the end of that time. 

The final treatment, which we call “heat pickling,” 
was accomplished by admitting the chlorine while the 
metal was at red heat immediately after degassing. 
The metal was then allowed to cool in the presence of 
the chlorine. The first samples were so treated with 
chlorine at a pressure of about 4 mm of mercury and 
almost all of it was absorbed by the metal. The second 
samples were treated with chlorine at a pressure of 


TABLE I. Work function of titanium and zirconium metals 
before and after treatment with chlorine. 

















Samples No. 1 Zirconium Titanium 
Sample as received 3.10+0.07 ev 3.10+0.07 ev 
Degassed 4.3340.07 4.45+0.05 
Pickled 4.76+0.11 4.76+0.11 
Heat pickled 4.72+0.07 5.54+0.06 

Samples No. 2 Zirconium Titanium 
Sample as received 3.8140.12 ev 4.36+0.04 ev 
Degassed 4.33+0.07 4.45+0.05 
Pickled 4.7640.11 4.76+0.11 
Heat pickled 4.72+0.07 5.27+0.09 
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Fic. 1. Block diagram of the apparatus. 


about 20 mm of mercury of which about half was 
absorbed. The second titanium sample was not pre- 
heated to as high a temperature and subsequently did 
not absorb the chlorine to the same extent as did the 
other sample. Its work function, as will be seen in 
Table I, was substantially lower than that of the first 
titanium sample. 

Each of these measurements was performed at least 
twice before the results were accepted. Although a 
period of several months often elapsed between tests 
of samples 1 and 2, reproducible results were obtained 
as can be seen in Table I. 


DISCUSSION OF RESULTS 


The. limits of accuracy of the measurements were 
set by the spacing of the mercury spectral lines. Also, 
we believe that each process of treatment was carried 
to its limit. For example, in each degassing process, 
the sample was heated until some of the cathode metal 
was evaporated onto the glass containers. Further, it 
was noted that although different amounts of chlorine 
were used for each sample in the pickling processes, the 
values of ¢ are the same, indicating that most probably 
the same amount of chlorine was adsorbed in each 
case. 





Ouellet and Rideal,? measuring work function by 
using the metal in question as the cathode of a Geiger 
tube, found that the shift in ¢ due to a gas is propor- 
tional to the amount of gas adsorbed. They measured 
the effect of up to 0.2 mm Hg of iodine and bromine 
vapor on the work function of gold, silver, and tungsten 
Geiger tube cathodes. The maximum changes in work 
function, found by them to be due to halogen gases 
range from 0.21 ev to 0.40 ev. Our own results for the 
pickling process, when more than sufficient chlorine 
was present for saturation, according to the standards 
of Ouellet and Rideal, gave us, for admittedly different 
metals and a different halogen, the value range 0.47 ev 
<A¢@<0.61 ev. These are the maximum values. 

The work functions of the degassed metals may be 
compared with the values obtained by Rentschler and 
Henry.‘ They found, for titanium, a value of ¢ of 4.17 
ev and for zirconium, a value of ¢ of 4.0 ev. They made 
their tests on pure titanium and pure zirconium de- 
posited by sputtering on thin wires. This may explain 
the fact that their values of the work function were 
slightly lower than our own. 


3G. Ouellet and E. K. Rideal, J. Chem. Phys. 3, 150 (1935). 
‘H. C. Rentschler and D. E. Henry, Trans. Electrochem. Soc. 
87, 289 (1945). 
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Delayed Yield and Strain Rate and Temperature Dependence of Yield Point in Iron 
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A unified explanation of delayed yield and of strain rate and temperature dependence of yield point in iron 
was attempted on the basis of the Cottrell locking theory, taking internal stress into account. The energy of a 
dislocation required to fit existing data is small by an order of magnitude compared with the value theoreti- 
cally estimated by Mott and Nabarro. The distance, calculated from the value of internal stress, between 
dislocations assembled at a grain boundary is of a reasonable order of magnitude. It is suggested that the 
mechanism of yielding may change from Cottrell type to Mott-Nabarro or Frank-Read type when applied 


stress falls below some limiting value. 





I, DELAYED YIELD 


OOD and Clark’ carried out experiments upon 

delayed yield, that -is, the phenomena of the 
time delay of yielding under constant stress in mild 
steel. In this paper it is attempted to explain their 
measurements in terms of the present theory of dis- 
locations. On the basis of the dislocation locking theory 
developed by Cottrell,? the mean delay time 7 for 
yielding is a reasonably good approximation’ given by 
the form 


r=1/m=7(a/a9)~/"*??, (1) 
where 


m=the rate of breaking away of dislocation, i.e., 
mdr is the probability per time interval, dr that 
a dislocation breaks away. 

1 /to= Zy. 

v=the vibration frequency of a dislocation. 

Z=the number of dislocations for breaking away in 
the region of stress concentration such as 
shoulders of a specimen. 

o=applied constant tensile stress. 

ao= tensile yield stress at absolute zero temperature. 

1/n=the constant‘ with the dimension of energy which 
is a function of A, W, p and X. The definitions of 
A, W, p and X are the same as in the paper of 
Cottrell and Bilby® (C-B). 
k=the Boltzmann’s constant. 
T=absolute temperature. 


If we take account of internal stress,® o;,’ arising from 
the presence of other nearby dislocations assembled at a 
grain boundary as an array of dislocations, Eq. (1) can 


( 1D. S. Wood and D. S. Clark, Trans. Am. Soc. Metals 43, 571 
1951). 

2A. H. Cottrell, Report of a Conference on the Strength of 
Solids, Physical Society (London), p. 30 (1948). 

3T. Yokobori, Phys. Rev. 88, 1423 (1952). 

41/n is not exactly expressed by a(AWp)! in the case where W 
or a/ao is very small. 

5 A. H. Cottrell and B. A. Bilby, Proc. Phys. Soc. (London) 62, 
19 (1949). 

6A. H. Cottrell, Symposium on the Plastic Deformation of 
Crystalline Solids, U. S. Office of Naval Research, p. 60 (1950). 

7 @; is the mean value of the internal stress corresponding to the 
position x; of stable equilibrium and x2 of the unstable one in 
the C-B paper (see reference 5). 


be written as 
T= 79(o—0;/a0—0;) "7, (2) 


where o—o,;=o,=applied constant tensile stress for 
breaking away dislocations anchored by Cottrell lock- 
ing, and o9—o;=00,.= break-away stress by Cottrell at 
absolute zero temperature. Assuming A=5X10-*! 
dyne-cm’, W=0.1 ev per atom plane, p=2X10-* cm, 
1/ro=1.3X10° sec, oo-=158 kg/mm’, and o;=18.8 
kg/mm”, the theoretical dependence of delay time upon 
stress and temperature is in good agreement® with the 
experiments of Wood and Clark! as shown in Fig. 1. 


II. STRAIN RATE AND TEMPERATURE 
DEPENDENCE OF YIELD POINT 
Yield stress under constant strain rate é is also given 
as a good approximation,*® on the basis of the same 
consideration given to delayed yield, 


o—¢0;= (oo—o,) (€Ero/nkToo)"*7, (3) 


where o=the observed upper (or sharply flowing) 
tensile yield stress and E is Young’s modulus. The data 
referred to below are assumed to satisfy this condition 
of constant strain rate. It can be seen from curve I in 
Fig. 2 that the theory is roughly in agreement with the 
experiments of Wood and Clark! when the same values 
of A, W, p, 70, goc, and o; as in Sec. I are used and €¢ is 
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Fic. 1. Dependence of delay time for yielding upon stress and 
temperature. Solid curves are calculated by Eq. (2). From data of 
Wood and Clark (see reference 1). 


8 The data at 121°C are too few to be compared with the theory. 
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Fic. 2. Dependence of yield point upon temperature. Solid 
curves are calculated by Eq. (3). Constants used are shown in 
Table I. 


assumed to be 3.6X10~-° sec~. It can also be under- 
stood from curves II, III, and IV in Fig. 2 that other 
data*" available from literature with iron and mild 
steel are in good accordance with the theory when 
different values of oo-, and o; from, but same values of 


A, W, po, ro, and € as those in curve I are assumed 
(Table I). 


III. DISCUSSION 


In this paper a single break-away process was as- 
sumed to be the rate-determining event in the initiation 
of yielding. 

It is noted that the energy of a dislocation required to 
fit existing data is small by an order of magnitude com- 
pared with the value estimated theoretically by Mott 
and Nabarro.” 


TABLE I. Values of constants in Eqs. (2) and (3), stress concen- 
tration factor g, and dislocation distance “a.” 


( A=5 X10" dyne cm?, 


W =0.1 ev per atom plane, ) 
p=2X10-* cm, 


1/ro=1.3X10° sec", €=3.6X10-5 sec. 











te = O96 — Ci Cv a 
Investigators kg/mm? kg/mm? q 10-* cm 
Wood and Clark 158 18.8 1.7 0.39 
McAdam and Mebs 151 8.2 1.8 0.58 
Eldin and Collins 178 11.0 ‘3 0.55 
Cottrell and Churchman 90 0 3.0 


(single crystal) 











*D. J. McAdam, Jr. and R. W. Mebs, Proc. Am. Soc. Testing 
Materials 43, 661 (1943). 

A. E. Eldin and S. C. Collins, J. Appl. Phys. 22, 1296 (1951). 

" A. H. Cottrell and A. T. Churchman, Nature 167, 943 (1951). 

2 N. F. Mott and F. R. N. Nabarro, Report of a Conference on 
the Strength of Solids, Physical Society (London), p. 1. (1948). 


If the region of stress concentration involves only the 
surface area of shoulders of a specimen, say of the order 
of 1 cm’, then Z is assumed to be about 108 in annealed 
state. From the value of 1/7 fitted to existing data, we 
find v~10 sec™', which is very much smaller than that 
estimated by Mott and Nabarro.” 

The theoretical break-away stress, o,0, for anchored 
dislocation at absolute zero temperature becomes 270 
kg/mm?, when calculated from 3V3A/4\°p?, which is 
greater than the value of c—o;(=o0-) extrapolated to 
0°K. This may be due to stress concentration, a factor 
of which is given by g=a0/aoe (Table 1). 

Assuming that the dislocation which is attempting to 
break away is a member of an infinite wall of anchored 
dislocations with the same sign, a distance ‘‘a’”’ between 
dislocations can be calculated* from go;=GdrAx/ 
2x(1—v)9a?. The value of “a” calculated is shown in 
Table I, which is slightly smaller than 10~* cm but of a 
reasonable order of magnitude. It can be seen from this 
analysis that g is larger and o; is far smaller (almost 
negligible) in single crystal than in polycrystal. The 
negligible order of o, in single crystal means that grain 
boundary acts as an obstacle. 

The calculated ratio of the activation energy to kT, 
i.e., U/kT is found to be about 23, which is smaller 
than 50 and of a reasonable order. 

When applied stress falls below some limit, the mecha- 
nism of yielding is expected to change from Cottrell 
type to Mott-Nabarro type in the presence of internal 
stress, or to Frank-Read type. The discrepancy be- 
tween the experiments and the theory as shown in 
Fig. 1 seems rather to be the result of this transition 
than the result of rough approximation in the range of 
very small o—o;/ao—¢;. It seems that because of the 
above transition the yield stress becomes almost inde- 
pendent of strain rate’® when the latter becomes slower, 
that is, stress becomes smaller than some limiting value. 
It also seems to be related to this transition that in the 
experiments of Kramer and Maddin" with 6 brass, the 
stress for initiation of plastic flow does not appear to 
be temperature and time dependent in the range of 
small applied stress. Seitz'® attempted to give these 
related phenomena another interpretation. 

On the basis of the suggestion put forward by 
Nabarro,!® the dependence of the delay time upon 
applied stress and temperature seems difficult to discuss 
quantitatively in his original idea. 

The author wishes to express his sincere thanks to 
Professor D. S. Clark and Professor A. H. Cottrell for 
sending him reprints of their papers. This work was 
carried out by the financial aids of the Ministry of 
Education of Japan. 


3 E. A. Davis, Trans. Am. Soc. Mech. Engrs.(J. Appl. Mech.) 
A137, 60 (1938). 

“JT. R. Kramer and R. Maddin, J. Metals 4, 197 (1952). 

15 F. Seitz, Phys. Rev. 88, 722 (1952). 

‘6 F. R. N. Nabarro, Report of a Conference on the Strength 
of Solids, Physical Society (London), p. 38 (1948). 
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Negentropy and Information in Telecommunications, Writing, and Reading 


L. BRILLOUIN 
I. B. M. Watson Laboratory, New York, New York 


(Received September 8, 1953) 


The negentropy principle of information was discussed in a previous paper and applied to a variety of 
problems of scientific observation. The present paper considers the technical problems of telecommunication. 
Using a thermodynamical discussion one obtains the Shannon formula for the capacity of a channel with 
noise. This example proves again the connection between information and negentropy. 

The problem of writing and reading is examined. Here we have a type of “‘dead information” which must 
be re-energized for reading: The source of light needed for reading provides the negentropy which is changed 


into information by the reader. 


The negentropy connected with “organization” is also discussed. All these examples show the connec- 
tion between information and negentropy. They are of great importance for a theory of information, although 
the entropy amounts involved may be completely negligible. 





1. THE NEGENTROPY PRINCIPLE OF INFORMATION 


HE statistical definition of information was dis- 

cussed in a previous paper.' The problem of in- 
formation connected with a physical system was 
investigated and it was found that “information,” in 
such a case, always corresponds to a negative term in 
the total entropy of the system (negentropy). 

Special emphasis was laid on the conditions of ob- 
servation in the physical sciences. In a variety of 
examples it was proved that an experiment, by which 
an information is obtained about a physical system, 
corresponds in average to an increase of entropy in the 
system or in its surrounding. An information must 
always be paid for in negentropy, the decrease of 
negentropy being larger than (or equal to) the amount 
of information obtained. 

A generalized Carnot’s principle was stated: 

“The sum of negentropy plus information must 
always decrease.” 

The present paper will discuss some applications of 
these general ideas to some practical questions of every- 
day life: telecommunication, speech, writing, and 
reading. It will be proved, in the first examples, that the 
negentropy principle of information leads directly to 
the well-known formula of C. E. Shannon for the 
capacity of a communication channel. The problem of 
communication through writing and reading requires 
a special discussion that will bring forward some new 
points of real importance.” 

The problems investigated in the previous paper 
corresponded to a variety of examples of transforma- 
tions of the general character 


negentropy—information. (1) 


The telecommunication channel presents a typical case 
of the inverse transformation 


information—negentropy (2) 


1L. Brillouin, J. Appl. Phys. 24, 1152 (1953). 

2 L. Brillouin, Am. Scientist 38, 594 (1950); see also a paper in 
the book L. de Broglie, Physicien et Penseur (Albin Michel, Paris, 
1953), p. 359. 


at the transmitting end. On the contrary, the receiving 
end changes negentropy back into information. 

Writing and reading provide examples of both trans- 
formations (1) and (2). 


2. TELECOMMUNICATION ON A CHANNEL WITH 
NOISE 


The problem of the telecommunication channel will 
be discussed along the lines of a paper of Raymond* 
and some results of Felker and Bell* will be obtained. 
We start with Shannon’s fundamental formula for the 
capacity of a channel with noise.’ We consider, for 
instance, a cable which can transmit all the frequencies 
from 0 up to a maximum vy. The cable is at a tempera- 
ture T and consequently exhibits a noise level corres- 
ponding to a certain average power P,. When the band 
width vy and the direction of propagation of the signals 
are given, we obtain, at a temperature T 


P,= kTvm. (3) 


This results directly from the classical discussion of 
Nyquist about the Johnson noise.*® 

At the transmitting end, a transmitter emits signals 
with an average power P. The information to be trans- 
mitted along the cable is translated into a system of 
pulses or in a variety of frequencies which travel along 
the cable. Shannon proves that the maximum capacity 
of the channel is given by the formula 


P 
C=Kvy in( 1+—). (4) 
P, 


If we choose K = logse, the capacity is given in “bits per 


3R. C. Raymond, Am. Scientist 38, 289 (1950). This paper 
contains a wrong coefficient 4 in some formulas. 

4J. A. Felker, Proc. Inst. Radio Engrs. 40, 728 (1952); D. A. 
Bell, Am. Scientist, 40, 682 (1952). 

5C. E. Shannon and W. Weaver, Mathematical Theory of 
Communication (University of Illinois Press, Urbana, 1949), p. 
67, Th. 17; Meyer-Leifer and W. F. Schreiber, Advances in Elec- 
tronics (Academic Press, Inc., New York, 1951), vol. III, p. 306. 

6 J. L. Lawson and G. E. Uhlenbeck, Threshold Signals M.L.T. 
Radiation Laboratory Series, 24 (McGraw-Hill Book Company, 
Inc., New York, 1950), p. 66, first line. 
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second.” Whatever coding system we use, the channel 
cannot transmit more than C bits of information per 
second. 

Instead of A we shall use in formula (4) the Boltz- 
mann constant & and measure the information in 
thermodynamical units. 


3. THE THERMODYNAMICS OF A CHANNEL 
WITH NOISE 


The cable is initially at a temperature JT and obtains, 
under normal conditions, an entropy So. When we 
transmit the telegram, we use an average electric power 
P during a certain time 7, and no heat. The entropy of 
the cable is not modified at first, but if we do not re- 
ceive the message, and leave it on the cable (now short 
circuited at both ends), it will propagate back and forth 
and be progressively attenuated by Joule effect in the 
resistances. Finally the whole electric energy Pr is 
changed into heat. This final stage corresponds to an 
entropy S larger than the initial entropy So. 

We can explain this situation by saying that the trans- 
mission of signals initially introduced on the cable a 
negentropy , 
AV=S—So (5 ) 
which was later on dissipated and lost in the resistances. 
The negentropy AN is the quantity which we want to 
compute. Some additional specifications are required at 
this point. 

We may first assume a rather sirong coupling be- 
tween the signals and the physical system (the cable). 
This is Raymond’s assumption “that the heat capacity 
of the closed system is so large that the change in 
temperature with operation of the communication de- 
vice is negligible.” This leads to a final entropy 


Pr 
S=Sot+—; (6) 
T 
hence 
Pr P 
AN =—= kvy—tr (7) 
T 


n 


with the help of Eq. (3). 

The negentropy AN represents information flowing at 
the rate of AN /r per second, hence the channel capacity 
(per second) 


AN 4 
C,=—= kvy—. (8) 
T P,, 


Let us compare the result with Shannon’s formula (4). 
We see that both formulas agree only for very small 
values of the ratio P/P, when the logarithm can be 
expanded and the expansion reduced to its first term 
only. 

Shannon’s formula was obtained under special as- 
sumptions which actually do not correspond to the 
model just discussed. Shannon assumed a loss less cable, 
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with no ohmic resistance. This means no coupling 
between the electric currents flowing on the cable and 
the physical cable itself. The source of noise is supposed 
to be outside the cable and to act independently. 
There is no direct mechanism here to produce the final 
increase of entropy that was found in our previous 
example. We may however keep the essential idea of no 
coupling between signals and cable and assume a dis- 
sipation mechanism of a different nature. The increase 
in entropy might be due to signal distortion, or to 
irregular changes in the spacing of current pulses and 
in the intensity of the pulses. 

The initial state was an organized system of pulses 
carrying the desired information, and the final state is a 


_ disorganized and disordered succession of pulses, 


distributed at random, and carrying no intelligible 
information. Our system has oscillating degrees of 
freedom’ and a random distribution of energy E, corres- 
ponds to a temperature T. 


n=VyT 


9 
E,=nkT. °) 


If we now assume the random energy £, to be pro- 
gressively increased by steps dE,, the entropy increase 
E2 dE, 


is 
Ez: dF, E2 
s-Si= f —=nk —=nkin—. (10) 
mn OT Ei £E, E; 


The initial state Z, corresponds to noise power P, only, 
(Eq. 3) running for a time r. The final energy level E; is 
obtained after all the additional energy Pr has been 
distributed at random between the oscillators, and 
added to noise power 


E\=Pat E.=(PatP)r; (11) 


hence the negentropy 
P,t+P 
AN=S—S)=nk ln . 


n 





This yields a channel capacity (per unit time) 


AN 28 P P 
Cy=—=k- in( 14+—) = br in( 1+—) (12) 


T 7 n 


which is identical with Shannon’s formula (4). The 
difference between our previous result (8) and the new 
one (12) comes from the fact that our original model 
maintained a constant temperature 7, while the new 
model exhibits a raise in temperature proportional to 
the raise in random energy E, with Eq. (9) valid at all 
times. This raise in temperature decreases the entropy 
difference (10) and results in a lower capacity for the 
channel. 

7It is a well-known result of signal analysis that a system of 


signals with frequencies y <vy and a duration r has m degrees of 
freedom. We can check it by comparison with Eq. (3) 


nkT = E,= P,t= kTvyr. 
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The logarithm appears here is a similar fashion to the 
well-known InV term in the entropy of an ideal gas. 

In conclusion we must emphasize the physical mean- 
ing of this discussion: Shannon’s formula really repre- 
sents a limit that cannot be overcome since it corres- 
ponds to optimum conditions with maximum efficiency 
unity in the transformation of negentropy into informa- 
tion. This results from the fact that the computations 
based on information (Shannon) or on negentropy, give 
identical results, provided all quantities are measured 
in the same unit system. 

If information is measured in bits and entropy in 
thermodynamical units, one finds that each bit corres- 
ponds to k]ln2~10~"* in thermodynamical units, since 
this is the ratio of units. This proves again that a bit of 
information can never be obtained for less than & In2 in 
entropy cost, as was proved in a previous paper.’ 


4. THE TRANSMISSION OF INFORMATION, 
LIVE INFORMATION 


The problem discussed in Sec. 3 is restricted to the 
simplest case of a communication from person to person, 
either by telegraph or by some telephonic device. This is 
a well defined and very important problem, the results 
of which can be extended to a variety of similar situa- 
tions; a person directly talking to another person, for 
instance. 

The theory however is not necessarily restricted to 
this special case and can be extended to much more 
general problems, provided special attention is paid to 
the conditions of operation. We intend to discuss these 
general problems, and to show that our negentropy 
principle of information enables us to investigate all 
possible circumstances of communication of information 
and to draw some important conclusions about their 
fundamental laws. 

The first interesting problem is broadcasting. Here we 
have one person talking to a whole audience: A—One 
speaker talking before a microphone connected to V 
cables with N different receivers. B—One given tele- 
gram sent to N different stations. C—A speaker in front 
of an audience in an auditorium. D—Radio broadcasts. 
In cases A and B we obviously have examples of our 
previous “person to person” communication with V 
channels operated in parallel. The transmitter must 
provide the power required for all the channels. If the 
channels are identical, we simply need N times more 
power for the transmission. The cases C and D are less 
favorable, because of the large amount of power spread 
around and not utilized; sound absorbed in walls, 
furniture and other sound absorbing materials, in case 
C, radio waves lost and dissipated in space, in case D. 
Except for these very large additional losses, these last 
cases are similar to the first ones A, B. Altogether, we 
may reduce these problems to examples of single 
channels operated in parallel. All these problems have 
some common characteristics; power and negentropy 
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are propagated together with the information. The 
receiver absorbs the energy and the negentropy in the 
same operation by which information is received. We 
shall characterize this situation by the expression ‘ive 
information.” The information is transmitted with the 
energy required for its detection. In the process of 
transmission, some energy is degraded, some negentropy 
is lost and information is lost by the same mechanism. 
The generalized Carnot principle states that negen- 
tropy must always decrease, and information follows the 
same law, in all these examples of live information. 


5. THE PROBLEM OF WRITING AND READING 


We find a completely different situation when we 
examine another type of transmission of information: 
writing and reading. There is obviously a certain ex- 
pense of energy in the process of writing, but we do not 
find any trace of this energy on the written paper. There 
is no visible negentropy left either, and the information 
is there, completely dissociated from these other ele- 
ments. This is not an isolated example, since a similar 
situation is found in all cases of recorded information: 
printed matter, phonograph records, magnetic tape, 
punched tape and punched cards, blueprints, etc. In 
these examples we have information stored and no 
energy nor negentropy connected with it. Furthermore, 
we can read out the information without destroying it. 
This seems to account for an unlimited possibility of 
multiplying information. 

We had a principle by which information and 
negentropy were always decreasing. Here we have in- 
formation without negentropy, and an apparent possi- 
bility for increasing indefinitely the information. 

Let us specify the problem on a typical example. An 
author writes a book and the book is published. We do 
not intend to discuss the problem of thinking and the 
work of the author. This is the human element which we 
have been very cautious never to include in our discus- 
sions. It does not make any difference whether we 
consider a book written by an author, or a mere collec- 
tion of numerical data: stock exchange quotations, 
weather observations, etc. If the manuscript is NV letters 
long, it contains a certain amount of information. Our 
definitions do not go beyond that point. 

The book is printed to 1000 copies, and each copy is 
read by 100 persons. The total amount of information 
has been multiplied by a factor 10°. Instead of decreas- 
ing, information has been increasing, and we cannot see 
any limit to this increase. 

Closer observation reveals that we shall have the 
usual causes for dissipation: misprints, errors of trans- 
mission, lack of understanding, etc. The actual factor 
will be less than 10°, but it can still be very large. 

The problem requires a closer examination in order to 
show how the negentropy principle of information can 
be applied to these situations. 
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6. DEAD INFORMATION AND HOW TO 
BRING IT BACK TO LIFE 

We characterize the preceding situations as cases of 
stored dead information, with no energy and no negen- 
tropy connected with it. The point is that in order to 
read out the information we must re-energize it. An 
additional source of energy is absolutely necessary for 
the reading, and this source of energy provides the 
negentropy which is changed into new information. 

We need a source of light to read a book. The phono- 
graph does not work without a motor to turn the records. 
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A blind person reading a book printed in the Braille 
alphabet has to move his finger on the paper and this 
motion requires work. We need light and photocells to 
read punched tape, or we may use a battery and 
brushes passing current through the holes of a punched 
card. In all these examples, the amount of negentropy 
taken from the external source is larger than the in- 
formation obtained. The case of reading a punched tape 
is a typical example, on which we can compute both 
entropy and information, and prove the relation be- 
tween them. 

Information is recorded on punched tape in binary 
digits (Fig. 1). The zeros are represented by holes and 
the ones by unperforated tape sections. Each section of 
the tape contains one bit of information, or 


Al=kIln2_ per section (13) 


in thermodynamical units, when & is Boltzmann’s 
constant. Our reading device (F’g. 2) uses a source of 


— oa — 


Beam of light 


Punched tape 
Fic. 2. Reading device. 
light, with a beam going through the hole and received 


in a photoelectric cell. Each section requires at least 
one quantum hy that is finally absorbed: 


either in the photoelectric cell, if there is a hole, or in 
the paper tape, if there is no hole. 


In order to be distinguished from the background of 
thermal radiation, the quantum hy must be high enough 
hv>kT \n2 (14) 


as was proved in a preceding paper.' When the quantum 
is absorbed, there is a loss in negentropy (an increase 
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AS in entropy). 
hy 
e~—ae (15) 


and we have the relation 

A(S—I)20 (16) 
which shows that our generalized Carnot principle is 
satisfied. In this simple case we prove that the informa- 
tion comes from the negentropy of the light beam, and 
we can theoretically reach the limit corresponding to an 
efficiency unity. The problem escaped notice because of 
the very small quantity of negentropy required 


k ln2~10~'* 


The preceding type of punching is not too reliable, 
since the ‘“‘one’s” are not really recorded. A plain piece 
of tape, with no holes would be read as a string of 
“ones,” which is wrong. A safer method would be to use 
two different types of holes, on two different levels 
(Fig. 3) punched in a reflecting metallic tape. This would 
require just as much energy for the reading operation. 
hence a AS just as large as in the preceding device. 


in cgs céntigrade degrees. (17) 
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We may use a system of brushes and a battery passing 
a current through the holes. The current is read on an 
ammeter or recorded in a relay. According to our dis- 
cussion in the preceding paper,' the limit in this case is 
again k |n2 per reading. In a mechanical device, like a 
phonograph record, small positive or negative pulses 
can be read out only if the mechanical system yields an 
energy larger than kT In2 per pulse, hence an entropy 
increase k In2. Similar remarks apply to pulses recorded 
on magnetic tape. In every case, the negentropy lost in 
the reading operation is larger than the information 
obtained and the generalized Carnot principle is always 
satisfied. 

Similar remarks apply to the problem of information 
recorded in blueprints, that can be used to build special 
machines. Each time we build a machine we must read 
out the blueprints. The Reading operation absorbs 
negentropy, part of which is changed into information 
imbedded in the machine built according to blueprints. 


7. WRITING AND PRINTING 


We can now consider another problem: how much 
negentropy is spent in the process of writing? This 
negentropy is lost, but we obtain a new copy of the 
information, hence the printing mechanism is another 
example of the transformation of negentropy into 
information. 

Photographic printing is the simplest example to 
discuss: we put the punched tape on a photographic 
paper and send a beam of light on each position. Each 
time, we have to use one quantum hy or more, in order 
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to print the position of a hole. The situation is similar to 
the case of Sec. 6 and leads to similar conclusions: 
printing a new copy costs exactly as much as reading it. 
The new information is paid for in negentropy. 

In all these discussions, we ignored typical irrever- 
sible processes: developing the photographic paper, or 
amplifying the electric impulses, etc. All these refine- 
ments of the technique cost a great deal in entropy, and 
the total cost of a reading or printing operation will be 
very much higher than the limit we computed. 

The essential point is: every information is paid for 
in negentropy. The amount of negentropy required is 
finite, but it is so small (10~'* per bit) that it has been 
completely ignored up to now. The connection between 
entropy and information is absolutely needed for con- 
sistency. The problems discussed in this paper would 
appear helpless otherwise. The very small value 10~'* 
of entropy required per bit of information plays an 
essential role in modern life, and makes it possible to 
communicate information at a negligible cost. 

Another important remark is the law of decay: any 
system on which information has been recorded will 
deteriorate in the long run. Furthermore the information 
itself may progressively lose its value. For both reasons, 
recorded information follows a general law of decay, 
which is consistent with the generalized Carnot prin- 
ciple. The decay will, however, be much slower for dead 
(recorded) information than for live information. 


8. INFORMATION AND ORGANIZATION 


We have been able to analyze the information content 
of a written sentence, of a set of numbers, and of many 
other situations where our statistical definition of in- 
formation could be directly applied.’ In a similar way, 
we may compute the information contained in a blue- 
print, provided this blueprint can be explained with a 
finite number of symbols. Let us consider, for instance, 
the wiring diagram of some electric device; we may 
number from 0 to WN all the terminals appearing on the 
diagram and explain the diagram by giving the numbers 
of the terminals connected together: a bracket (1023- 
0216) simply means that we must connect terminal 
1023 to terminal 0216. For a finite number of terminals 
there is a finite number P of possible connections, out of 
which the wiring diagram selects one special system. It 
therefore contains an information (see paper 1) 


I=K InP. (18) 


Let us again emphasize the exact meaning of such a 
definition: it does not make any difference between a 
wiring diagram corresponding to a working model and 
another diagram that would yield a worthless structure 
with shortcircuits or other defects. The information we 
compute is just the information given to the craftsman 
who is going to do the wiring, and has no specific idea 
of what the machine is going to do. 

The question why this diagram has been selected and 


§ L. Brillouin, Am. Scientist 38, 596 (1950); J. Rothstein, Inst. 
Radio Engrs. conference (March, 1952). 
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how it works is not considered. This was the problem of 
the scientist or the engineer who invented the device. 
It required thinking and scientific discussion, all brain 
operations which are beyond the scope of our present 
methods of investigation. 

Let us take a specific example: the electric device 
contains m circuit elements, each of them with m 
terminals, and g terminals for outside connections (in- 
cluding ground or open connection). Altogether we have 


N=nm+ q (19) 


terminals, each of which can be connected to any other 
one, giving 
P=NN 


different possibilities. The information contained in one 
wiring diagram is 


I=K |lnP=KN InN. (20) 
Taking K=log2e~ 1.4, we obtain the information in bits 
IT=N logeN. 


Assuming reasonable orders of magnitude we consider 
the case 
n=999 m=10 g=10 
hence 
N=10 000+ 284 


I= 133 000 bits. (21) 


This is a large figure in bits, but if we compute it in 
entropy units, we must multiply by 10~'® which is 
approximately the ratio of units and we obtain a 
negentropy 

—S=1.33 10- (22) 


which is completely negligible. 

The machine built according to our specifications 
contains such a structural negentropy which represents 
the information or organization of the machine. This 
negative term however is completely negligible when 
compared with the total entropy of the machine. This 
example exhibits some characteristic features of the 
whole theory. 

The negentropy principle of information requires that 
every piece of information be connected with a corres- 
ponding negative term in the physical entropy of a 
system. This connection is absolutely needed for con- 
sistency and was proved by the discussions of the 
preceding paper.’ 

Practically, these negative terms can be ignored in 
most problems because of the small coefficient (10~'*) 
introduced by the change from binary digits (bits) to 
thermodynamical units. The smallness of these terms is 
the fundamental reason why transmission of informa- 
tion by any practical method: writing, printing, tele- 
communication, is so inexpensive in entropy units, 
which also means inexpensive in dollar units. Modern 
life is based on these facts and would be completely 
different in a world where the negentropy of information 
would have a larger value. 
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The electrolytic analog transistor is an operating model of the junction transistor which substitutes the 
reduced and oxidized forms of ions in solution for electrons and holes in a semiconductor. A base electrode 
makes a low-resistance contact to the solution and also serves to maintain the ratio of oxidized and reduced 
ions at an equilibrium value, thus establishing the potential of the solution. In P-N-P operation, a polarizable 
metal electrode (the emitter) is held at such a potential above the base electrode that the majority ions, the 
reduced form, are readily oxidized at its surface. An identical electrode, the collector, is placed within a few 
tenths of a millimeter of the emitter. It is biased in such a way that the minority ions (the oxidized form) 
which diffuse from the emitter to its surface are reduced. Utilizing several oxidation-reduction couples and 
different base electrodes, it has been found that values of a= —(0/./d/,.)y, in the neighborhood of unity 
can be obtained. Because of the relatively low mobilities of ions in solution, the device operates only at 
low frequencies. The device may also have some application as a constant current element in transistor bias 


circuits. 


I. INTRODUCTION 


HE operation of both point-contact! and junction? 

transistors depends on current flow in semi- 
conductors when carriers of two signs, electrons and 
holes, are present. For example, in a P-N-P junction 
transistor, holes injected at one P-N junction, the 
emitter, diffuse across the narrow N region to the 
collector junction. The magnitude of the current 
depends upon the voltage difference between the 
emitter and a base electrode which makes a low-resist- 
ance contact to the N region. When the collector is 
biased in the reverse direction, most of the current 
flowing to the collector comes from the emitter and the 
current is nearly independent of collector voltage. 

We have constructed an electrolytic analog of the 
junction transistor in which the flow of ions in solution 
replaces the flow of electrons and holes in the semi- 
conductor. An oxidation-reduction couple forms a 
suitable system. 

An ion in a reduced state in a solution which is 
normally oxidized (consists largely of ions in the 
oxidized state under the chosen conditions) corresponds 
to an excess electron. An ion in an oxidized state in a 
normally reduced solution corresponds to an electron 
defect or hole. In the analog of the P-N-P transistor 
the electrolyte is normally in the reduced state. One 
electrode, the analog of the base, makes a low-resistance 
contact to the solution. The emitter and collector of 
the analog are similar highly polarizable electrodes 
such that little current flows unless the voltage differ- 
ence between one of the electrodes (the emitter) and 
the base exceeds a critical value (the decomposition 
potential) for oxidation to occur. The collector is 
maintained at a lower voltage than the emitter and is 
placed very close to it. Ions oxidized at the emitter 
diffuse to the collector where the reverse reaction occurs 
with the return of the ions to the reduced state. If the 

* Supported in part by the U. S. Office of Naval Research. 


1 J. Bardeen and W. H. Brattain, Phys. Rev. 74, 230 (1948). 
* W. Shockley, Bell System Tech. J. 28, 435 (1949). 


solution is normally in the oxidized state, one has the 
analog of the \-P-N transistor. The polarities are 
reversed and the reaction occuring at the emitter is 
a reduction. 

If the conductivity of the electrolyte is sufficiently 
high, the electric fields in solution are small and the 
ions move mainly by diffusion. Inert electrolytes may 
be added to increase conductivity. 

The frequency response of a transistor is limited by 
the time taken for carriers to diffuse from the emitter 
to the collector. In germanium transistors, transit 
times are of the order of 10~’ second. Since the mo- 
bilities of electrons and holes in germanium are at 
least 10’ times larger than those of ions in solution, 
the transit time in the electrolytic transistor is one 
second or longer. Transient phenomena which occur 
in the megacycle range in ordinary transistors occur in 
the analog on a greatly expanded time scale and may 
thus be studied more readily. 

It is believed that the analog may be useful in 
studying flow problems in transistor structures. The 
geometrical configuration of electrodes in solution 
can be changed readily to test a variety of structures. 
It is not yet certain whether the analog can be made 
sufficiently close so that it can be used in a quantitative 
as well as a qualitative manner. 

It is also believed that the electrolytic transistor may 
have useful applications of its own for current control 
devices for which ordinary transistors are not suitable. 


Il. DESIGN OF THE CELL 


In order that transistor action be observed, it is 
necessary that ionic or molecular species produced 
at one electrode control] the current at the other, either 
through specific participation in the electrode process 
or else in the rate-determining step. To assure that such 
action occur in a reasonable time, and also to avoid 
losses through convection or off-axis diffusion, the 
emitter and collector must be quite close to one another. 
In the arrangement which we have used, the emitter 
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and collector are circular platinum electrodes 0.5 in. 
in diameter, one being moved by a micrometer screw 
arrangement and the other being fixed. The reverse 
sides of these electrodes are insulated from the solution. 

The base electrode must serve to establish the 
potential of the solution. The most suitable electrode 
for this purpose is a relatively unpolarizable one which 
interacts reversibly with one or more of the ionic 
species in solution. For example, a mercury pool in 
contact with mercurous chloride and chloride ion is a 
satisfactory electrode. The base electrode serves to 
maintain the desired equilibrium between oxidized and 
reduced species. 

The cell itself is an open framework constructed of 
lucite. All joints which come into contact with the 
solution as well as all submerged metallic parts, except 
for the emitter and collector faces, are thoroughly 
coated with a suitable cement. As is shown in Fig. 1, 
the electrical leads are insulated from the solution by 
being inserted through the cell framework. A piston- 
cylinder arrangement affords rather high orientational 
stability of the movable electrode with respect to the 
fixed one. 


Ill. EXPERIMENTAL 


The circuitry involved in the experimental arrange- 
ment was quite standard. Storage batteries were 
connected through potentiometer rheostats to the 
emitter and base and collector and base, respectively. 
Voltmeters and ammeters were connected in the usual 
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for the P-N-P electrolytic transistor. 
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>. 3. Rectification characteristic for the 
chloride-chlorine couple. 


way. This, along with the current and voltage con- 
vention for P-N-P operation, is shown in Fig. 2. 

The oxidation-reduction couples utilized in the 
present research had an important property in common, 
namely, that in contact with the base electrode the 
equilibrium concentration ratios of the two species 
yielded large numbers. It is easily shown that in one 
of the solutions used the equilibrium ferrous ion 
concentration is greater than that of ferric ion by a 
factor of 10’. ‘ 

In general, the techniques used were simple. A 
solution of the desired components was prepared and 
allowed to stand over the base electrode in a beaker. 
The cell was then placed into the beaker with the 
desired gross setting of the electrode spacing. No 
measurements were taken until stirring subsided. It 
was found that the device quickly recovered from 
ordinary vibrations in the laboratory. If the electrodes 
were spaced less than a few tenths of a millimeter 
apart, vibrations produced no observable effects. This 
spacing also eliminated interference which could result 
from thermal or density gradients in the solution by 
assuring that extraneous flow between the electrodes 
was essentially laminar and, if not of short duration, 
at least uniform. 
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TABLE I. Emitter and collector characteristics of 
the electrolytic transistor. 








(Chloride-Chlorine Couple) 





Ie (ma) Ve (v) Ie (ma) Ve (v) 
0.154 1.00 —0.123 0.00 
0.482 1.05 —0.418 0.00 
2.42 1.10 —2.17 0.00 
5.17 1.15 — 4.80 0.00 
7.32 1.18 — 6.89 0.00 
9.09 1.23 — 8.62 0.00 
0.0467 1.00 — 0.0836 0.30 
0.389 1.05 — 0.337 0.30 
2.21 1.10 —1.97 0.30 
5.33 1.15 —5.02 0.30 
7.43 1.20 —7.03 0.30 
8.64 1.25 — 8.23 0.30 








A. Chloride-Chlorine Couple 


A solution 1M with respect to NaCl and 0.2M with 
respect to HCI was prepared in distilled water. In two 
electrode operation, i.e., with the collector disconnected, 
the rectification curve shown in Fig. 3 was obtained. 
The base electrode was a mercury, mercurous chloride 
pool. , 

In transistor operation, V. (collector voltage) was 
held constant while V, (emitter voltage) was increased. 
From the data in Table I, it can be seen that the emitter 
was operated in the sharply rising portion of the 
voltage-current curve shown in Fig. 3. As V, increased, 
I, (emitter current) increased. This resulted in an 
increase in J, (collector current). The emitter current 
is plotted against the collector current in Fig. 4. The 
slope of this curve is a= (—0/,/dI.)y,. This is essen- 
tially a measure of the efficiency of the modulation 
of the current in the collector loop by the emitter. 
With this particular value of electrode spacing, a= 0.96. 
This corresponds to collection of 96 percent of the 
chlorine produced at the emitter. Since all of the data 
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Fic. 4. Electrolytic transistor current characteristics 
for the chloride-chlorine couple. 
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fall upon one curve, it is seen that a is essentially 
independent of the collector voltage in the range 
studied. 

If the process under investigation is diffusion limited, 
one would suspect that the modulation as reflected in 
a is a function of the emitter-collector spacing d. The 
parameter d was varied between 0.1 and 1.2 mm. 
These data are presented graphically in Fig. 5. It will 
be noted that a does decrease with increasing d. 
Figure 6 is a graph of d versus a. 

The chloride-chlorine couple is more stable than that 
attributable to the oxy-acids of chlorine except for the 
chloride-perchlorate couple. ‘The latter couple is 
eliminated because it is essentially irreversible at room 
temperature. Further, one observes the presence of 
Cl, in the solution surrounding the emitter by its 
characteristic yellow-green color. 
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Fic. 5. Electrolytic transistor current characteristics for the 
chloride-chlorine couple as a function of emitter-collector spacing 


B. Ferrous-Ferric Couple 


A ferrous ion solution containing 0.25M FeSO, and 
1.5M HCl was investigated. The rectification charac- 
teristic curve shown in Fig. 7 indicates the appearance 
of the expected limiting current. The electrode separa- 
tion parameter d being about 0.08 mm, one expects 
a to be very nearly unity. It is shown graphically in 
Fig. 8 that this is the case. The corresponding data are 
given in Table II. 

The collector current plotted against the collector 
voltage for this solution is shown in Fig. 9. The resultant 
output characteristic indicates that the collector 
operates independently of its bias in the region from 
—0.25 v to +0.25 v. 


C. Bromide-Bromine Couple 


It was pointed out to us by Professor H. A. Laitinen 
that the highly reversible bromide-bromine couple 
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Fic. 6. a as a function of emitter-collector spacing for 
the chloride-chlorine couple. 


might be satisfactory for use in this apparatus. This 
was attempted with the use of two different base 
electrodes, mercury, mercurous bromide and silver, 
silver bromide. 


(1) Mercury, Mercurous Bromide Electrode 


The rectification characteristic obtained using a 
solution of 1M NaBr was identical to that shown in 
Fig. 3 except for the fact that it was not quite so flat in 
the low-voltage region and the break occurred near 
0.8 v. 

It is well known that bromine decomposes water in 
solutions of low acidity. In the present case, this would 
result in an increase in a with a decrease in pH. This 
was investigated in four solutions of pH 7, 4, 2.5, and 
1 with the volume and bromide ion concentration 
remaining constant. Decrease in pH was brought about 
through the addition of HBr. It was found that the 
value of a was constant at 0.98. Emitter and collector 
characteristics for pH 7 and pH 1 are reproduced in 
Table ITI. 

It is not improbable that in a slightly basic solution 
this property could be utilized to simulate the phe- 
nomenon of recombination of injected carriers in the 
base region of a junction transistor. 
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Fic. 7. Rectification characteristic for 
the ferrous-ferric couple. 
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Fic. 8. Electrolytic transistor current characteristic 
for the ferrous-ferric couple. 


(2) Silver, Silver Bromide Base Electrode 


A slight modification of the base electrode contact 
was made for this experiment. The rectification charac- 
teristic using a solution 1M with respect to NaBr and 
0.1M with respect to HBr was extremely flat from 0.0 


TABLE II. Emitter and collector characteristics of 
the electrolytic transistor. 








(Ferrous-Ferric Couple) 





Ie (ma) Ve (v) Ie (ma) Ve (v) 
1.30 0.32 — 1.37 0.00 
4.05 0.40 — 4.00 0.00 
8.25 0.43 — 8.12 0.00 

10.0 0.44 — 9.86 0.00 
15.2 0.46 — 15.5 0.00 
20.8 0.50 —21.0 0.00 
30.6 0.57 — 30.3 0.00 








to 0.9 v. Although no gas evolution was observed until 
the applied voltage was in the neighborhood of 2 v, 
there is a suggestion of an inflection point in the rectifi- 
cation curve at 1.1 v. This, if it is real, would indicate 
that a process other than the oxidation of bromide to 
bromine occurs near this voltage. 
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Fic. 9. Electrolytic transistor output characteristic 
for the ferrous-ferric couple. 
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TABLE III. Emitter and collector characteristics of 
bromide-bromine couple as a function of pH. 











I. (ma) Ve (v) I< (ma) Ve (v) 
pH7 0.506 0.89 — 0.480 0.25 
1.41 0.91 — 1.32 0.25 
2.67 0.93 —2.57 0.25 
5.16 0.96 — 5.05 0.25 
7.63 0.98 —7.52 0.25 
9.62 1.00 —9.50 0.25 
pH 1 0.376 0.89 —0.363 0.25 
0.951 0.92 —0.920 0.25 
2.24 0.92; —2.12 0.25 
4.84 0.95 — 4.67 0.25 
6.87 0.97 —6.70 0.25 
9.33 0.99 —9.10 0.25 


In operation as a transistor, this solution yielded 
a=0.92 with a spacing of about 0.04 mm. The output 
characteristic, shown graphically in Fig. 10, was 
exceptionally flat. 


IV. THEORY 


The conductivities of the solutions used are so high 
that the flow of ions between the emitter and collector 
is mainly by diffusion. Electric fields and consequently 
potential differences in the body of the electrolyte are 
small. There will, however, be space-charge double 
layers extending outward a very small distance from 
the electrodes. When voltages are applied to the 
electrodes, part of the voltage drop may be across the 
double layers and part may appear through the body 
of the solution as a concentration difference. 

Let us first consider the diffusion of ions between 
the emitter and collector when the spacing is such that 
a is nearly equal to unity. Let c, and c; be the concen- 
tration of the majority species at the emitter and 
collector, respectively, and let cz be the concentration 
at the emitter of the (minority) species of ion produced 
there. These concentrations refer to positions just 
outside of the double layers at the electrodes. We shall 
assume that the collector is biased in such a way that 
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Fic. 10. Electrolytic transistor output characteristic for the 
bromide-bromine couple and silver-silver bromide base electrode. 
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the concentration of minority ions at the collector is 
practically zero. The variation of the concentrations 
between the emitter and collector is then as indicated 
in Fig. 11. In the ferrous-ferric couple c; and c; refer to 
the ferrous ion and c; to the ferric ion. 

‘ Since under steady-state conditions the particle 
currents of the two species of ions must be equal, the 
current density may be expressed as 


T= eD,(c3—¢;)/d=eD2c2/d, (1) 


where D,; and D, are the diffusion coéfficients and d 
the emitter-collector spacing. Equation (1) may be 
written é' 

bce=C3—C, (2) 


where b= D./D,. 

If it is assumed, as is likely the case under the 
conditions of our experiments, that the total concen- 
tration remains constant, we have 


$6o+3 (cy +03) = C0, (3) 
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Fic. 11. Electrolytic transistor concentration convention. 


where ¢io is the initial equilibrium concentration of 
the majority species. 

One further relation is required to determine how 
the concentrations vary with applied voltage. If it is 
assumed that the current is limited by diffusion, the 
voltage drop will occur in the body of the electrolyte 
and for a simple couple such as ferrous-ferric, 


C2/¢,=expLe(V —Vo)/kT], (4) 


where V is the emitter voltage and Vo is the standard 
oxidation potential of the cell which is computed by the 
algebraic addition of the standard electrode potentials 
of the operating couples. If cio and coo are the equi- 
librium concentrations of the two species 


C20/C10= exp(—eV0/kT), 


Eq. (4) may be written 
C2/€1= (€20/¢10) exp(eV /RT). (5) 
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Solution of Eqs. (2), (3), and (5) for c2 gives 
C2= Co explLe(V—Vo)/kT]/ 
{1+[(1+0)/2] exple(V—Vo)/kT]}. (6) 
If one defines 
Io= eDee,0/d, (7) 


the current density is 
I=Iy exple(V—Vo)kT]/ 
{1+[(1+-5)/2] exple(V—Vo)/kT]}. (8) 


For this case, in which the current is limited by 
diffusion, the current should be inversely proportional 
to the electrode spacing d. If, on the other hand, the 
current is limited by an electrode reaction, it will be 
essentially independent of d. Figure 12 is a plot of I 
versus 1/d for the chloride-chlorine couple. When 1/d 
is large (d small), the current approaches a constant 
value which indicates that in this limit the current is 
determined by an electrode reaction. The decrease in 
current with d for the larger spacings indicates that the 
current then is diffusion limited. The ferrous-ferric 
couple was observed to behave similarly. 

The theory for the chloride-chlorine couple is some- 
what more complicated than that given above because 
the chlorine is in solution in molecular form. The 
equilibrium concentration [Cls]o of Cle molecules in 
solution, determined from the reaction 


Hg+3Cl—3Hg2Ch., 


[Cle oh = exp(—eV0/RT), (9) 


where Vo is determined from the difference in standard 
oxidation potentials for the half-reactions: 


Clr-=3Clo+e (E°= — 1.3583 v)* 
Hg+Cl-=3Hg.Clo+e (E°=—0.2676 v)*. 
Thus, 
V 9= 1.3583—0.2676= 1.0907 v. (10) 


If the current is diffusion limited, 
[Cle ]¥em/LCl Jem = (LCl2]o#/LCl- Jo) exp(eV/RT), (11) 
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Fic. 12. Emitter current as a function of 
emitter-collector spacing. 


3W. M. Latimer, Oxidation Potentials (Prentice-Hall, Inc., 
New York, 1938). 
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Fic. 13. Electrolytic transistor emitter voltage-log 
emitter current characteristic. 


where the subscript “em” refers to the concentration 
just outside the double layer at the emitter. Equation 
(11) replaces Eqs. (4) and (5). Equations (1) and (3) 
also require modification. Let 


C2= [Cle Jem; j= ita ® 


Equation (1) becomes 
I= eD,(c3—¢;)/d=2eDoc2/d, (12) 


since each Cl, molecule contains two atoms. Similarly, 
Eq. (3) is replaced by 


Cot} (C1 +C3) = Cro. (13) 


Equations (11), (12), and (13) must then be solved 
for the concentration and current density. One is led 
to a quadratic equation for c, whose solution will not 
be given explicitly. When V is small, c; is nearly equal 
to the equilibrium value cio, and cz (and thus J) varies 
as exp(2eV/kT). When V is very large, the current 
tends to saturate at a constant value. 

If the current is limited by an electrode reaction 
involving the majority species, one expects 


I~c, exp(BeV/kT), (14) 


where 6 is a numerical parameter which depends upon 
the nature of the reaction. This again leads to an 
exponential variation of current with voltage when V 
is small and saturation at a constant value when V is 
large. This behavior is shown for the chloride-chlorine 
case in Fig. 13. 
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V. DISCUSSION 


The behavior of the electrolytic analog transistor 
under the conditions described above is basically as 
predicted. This extension of the phenomenon of 
transistor action to the liquid state confirms the general 
applicability of the physical analysis of the semi- 
conductor transistor. 

Two types of electrochemical reactions have been 
studied. These are the conversion of an ion to an 
uncharged molecule and the change in valency of an 
ionic couple. Metal-metal ion couples could be selected 
in various combinations as could metal, metal salt 
electrodes in the presence of their anions. The operation 
of the cell would also be possible with such organic 
couples as quinone-hydroquinone under various pH 
conditions. Nonaqueous solvents might also be used. 

The thermodynamic basis of collector activity is 
easily understood by a consideration of the details of 
emitter action. The entire cell is initially in equilibrium 
with the establishment of a certain stoichiometric 
balance as dictated by the standard electrode potentials 
of the operating couples and the ambient temperature 
and pressure. Upon the ingertion of an emf in the 
emitter circuit, the emitter and base tend toward an- 
other stoichiometric balance. 

Consideration of a definite set of couples will facilitate 
the explanation of the action in the collector circuit. 
If the emitter and collector are reversible to the ferrous- 
ferric couple and the base is a calomel electrode (which 
is reversible to chloride ion), the introduction of an 
emf into the emitter circuit results in the production 
of ferric ions at the emitter and chloride ions at the base. 
Ferric ions which diffuse to the collector are reconverted 
to ferrous ions. Under steady-state conditions the cur- 
rent to the base electrode is small. Thus, the collector 
circuit operates to restore equilibrium even though no 
bias voltage is applied. If, in this case, a positive bias 
is applied, it should in no way affect the tendency of 
the collector to seek equilibrium unless it is of such 
magnitude as to be near or greater than the decomposi- 
tion voltage of the ferrous-ferric couple. 

An unexpected result of this investigation was the 
observation of the fact that the limiting current of 
several solutions as determined from rectification 
characteristics was enhanced when the device was 
placed in three electrode operation. The enhancement 
was a factor of about two for the chloride-chlorine 
couple and about eight for the ferrous-ferric couple. 

At these spacings, the current is limited by diffusion 
of the majority ion in solution (in these cases, the 
chloride and ferrous ions) to the emitter. The width of 
the diffusion region is generally determined by con- 
vection currents in the liquid. If the spacing between 
the emitter and collector is less than this width, the 
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diffusion path is decreased and the current enhanced. 
The concentration of the majority ion at the collector 
is determined by the potential of the collector relative 
to the solution and is maintained by the reverse reaction 
which occurs at the collector. 

The graph shown in Fig. 10 demonstrates the be- 
havior of the output characteristic under varying bias 
conditions. This property of the electrolytic analog 
transistor has led to a consideration of its utility as a 
constant current element in transistor biasing circuits. 
As was mentioned above, this device should not respond 
to alternating currents above one cycle or so. 

This model of the electrolytic transistor is limited in 
current gain to unity in grounded base operation just 
as is the junction transistor. Since the output impedance 
is much higher than the input impedance, power gain 
can be obtained at very low frequencies. 

Experiment indicates that the current is not strictly 
diffusion limited at small emitter-collector spacings. 
The rate of the reaction occurring at the electrode 
controls the process under these conditions. At larger 
distances, the steady-state process is controlled by 
diffusion. Analysis of several curves such as that of 
Fig. 13 indicates that the value of 8 in Eq. (14), 
which is obtained from the lower asymptote to the 
curves, is nearly unity for the halide-halogen couples. 
The value of this parameter is approximately 
1.5 if small quantities of cerous and ceric sulfates 
are present in the chloride-chlorine solution. Since it 
is probable that these ions in some way catalyze the 
rate-determining electrode process, this observation 
would indicate that the theoretical value of 8 is indeed 
not attained because of a slow step at the electrode. 
Though the lower asymptotic current was not attained 
in the case of the ferrous-ferric couple, extrapolation 
of the curve to a value of 8 equal to unity appeared 
to be reasonable. 

The apparatus used in these experiments included 
bright platinum electrodes. It is evident that the value 
found for 6 depends strongly upon the instantaneous 
condition of the electrode surfaces. 

Research is currently being carried on for the pur- 
pose of investigating the electrolytic transistor as a 
circuit element and to determine its behavior as a 
function of temperature. 
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The intrinsic diffusivities of individual components in several binary systems have been determined with 
vapor-solid type diffusion couples. In this type of couple a high vapor-pressure component is diffused from 
the vapor phase into a slab of lower vapor pressure initially containing inert markers on its surface. Diffu- 
sivities may then be calculated from marker movement and diffusion-penetration curve data. Equations 


necessary for analysis of such couples are developed. 


The alpha brass, copper-nickel, and silver-gold systems were investigated. Observations of polygonization 
during diffusion and dimensional changes normal to the diffusion direction were also made. Porosity forma- 
tion, which occurred only in the vapor-solid silver-gold couples, was investigated in detail and was com- 
pared to porosity formation in corresponding sandwich couples. The general use and limitations of such 


couples in diffusion studies are ‘discussed. 





I, INTRODUCTION 


IFFUSION phenomena in metal systems may be 

conveniently studied in certain cases by the use 
of vapor-solid couples in which one component is trans- 
ported to the specimen surface through the vapor 
phase. In this type of couple, inert markers are 
initially placed on the surface of a solid slab which is 
then exposed to the vapor of a high vapor pressure 
component at diffusion temperatures. In general, the 
high vapor pressure component diffuses more rapidly, 
and as a result of the rapid inward diffusion from the 
vapor the slab expands and the markers are pushed out- 
ward with the expanding specimen. At the same time 
the markers become buried because of the smaller 
diffusional flow of the lower vapor-pressure component 
in the opposite direction. 

A photomicrograph of the buried markers in a typical 
alpha brass couple of this type is given in Fig. 1, anda 
schematic diagram of the diffusion zone is shown in 
Fig. 2. Consideration of the diffusion flow of zinc into 
the specimen from the vapor and of copper in the oppo- 
site direction shows that the entire hatched area, includ- 
ing the cross-hatched area, is proportional to the amount 
of zinc diffused inward past the markers and that the 
smaller cross-hatched area is proportional to the amount 
of copper diffused outwards past the markers. The ratio 
of diffusivities of copper and zinc is given by the ratio 
of these two areas. In addition, the intrinsic diffusivities 
may be determined from marker movement and diffu- 
sion-penetration data. 

Such couples possess a number of practical advan- 
tages. No welding or plating is required, and single 
crystals may therefore be conveniently handled. Inert 
markers are easily attached to the specimen surface, 
and their movements can be determined with particular 


* This work was sponsored by the U. S. Atomic Energy Com- 
mission and was done under Contract AT-30-1GEN-367. 

1 R. W. Balluffi and B. H. Alexander, J. Metals 4, 1315 (1952). 
( ?R. W. Balluffi and B. H. Alexander, J. Appl. Phys. 23, 953 
1952). 

*R. W. Balluffi and B. H. Alexander, J. Appl. Phys. 23, 1237 
(1952). 


ease and accuracy, since no graphical integration or 
placement of reference foils are necessary.‘ In addition, 
porosity formation is avoided in a number of systems 
where it occurs in sandwich-type couples. The chief 
limitation of the method is that only components of 
widely differing vapor pressure may be used. In order 
to obtain absolute values of the intrinsic diffusion coeffi- 
cients the vapor pressure of the volatile constituent 
must also be high enough so that the concentration at 
the vapor-solid interface is maintained constant during 
diffusion. If this boundary condition is not met, ab- 
solute values of the diffusivities cannot be derived from 
the diffusion curves. However, the difference between 
intrinsic diffusivities may still be obtained from marker 
movement data. 


Il. MATHEMATICAL ANALYSIS 


A proper analytical treatment of vapor-solid diffusion 
couples must take into account both the variation of 
diffusivity with concentration and dimensional changes 
caused by the absorption of mass from the vapor phase. 

















Fic. 1. Inert markers buried in alpha brass couple 
diffused 35.25 hr at 890°C. 1000X. 


*L. C. C. da Silva and R. F. Mehl, Trans. Am. Inst. Mining 
Engrs. 191, 155 (1951). 
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Fic. 2. Schematic diagram of alpha brass vapor-solid ] 9 7 
couple containing inert markers. 


In order to obtain a mathematical analysis for the one 
dimensional case, it is assumed with Darken® that the 
diffusion process may be described by the use of 
individual diffusion coefficients for each element. If dif- 
fusion is considered as motion relative to inert markers 
then the total rate of transport of component 1 across 
any plane stationary with respect to the vapor-solid 
interface is 


ac, 
h=- PORT Ce), (1) 
x 
where 
ac, | 
—¢o(t)= >| —, (2) 
Ox eek, 


In the above, v is the velocity of mass flow due to 
possible inequalities in the intrinsic diffusion coefficients, 
and ¢ is the velocity of mass flow arising from absorption 
of matter from the vapor. 

Following Darken,® therefore, 


0 ac, OC, 
| + D-——cu(as)— Cold |=0, (3) 


Ox Ox Ox 


and on integration 
ac, dC» 
D,—+ D.—-— Co(x,t) -—Ce(t) =1. (4) 


Ox Ox 


For a semi-infinite solid, as x becomes very large, 
aC,/dx, AC2/dx, and v approach zero; therefore the 
constant of integration, J, = —C¢(t), and 


1 0c; 0C2 = 
v= _|p—+ D—| (5) 


Ox Ox 


5 L. S. Darken, Trans. Am. Inst. Mining Met. Engrs. 175, 184 
(1948). 
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Inserting the values for v and ¢ in Eq. (1), and expres- 
sing concentrations in terms of atom fractions 


ON, 9 ON, 
—— -| (Di\No+ Dwv)—| 
ot Ox Ox 


ON ON, 
+[>—] -——, (6) 
Ox z=0 


Trying for the solution V,= f(x/t!)= f(A) 


dN, df dN, 
—1\—= | (DiNeo+ V2D\)— 
dy dx dr 
dV, an, 
+| o— om, (7) 
dx A=0 dr 





Therefore, V,= f(A) is a solution when the origin is 
placed at the vapor-solid interface, providing the con- 
centration at this plane remains constant. When this 
condition is satisfied the term [D,(dN1/dX)],-0 is a 
constant ¢/2 and Eq. (7) may be written 


dN, d dN, 
— 1a+9—=—| (D\No+ Dsv)—| (8) 
dy dn. dy 


é 


By making use of the translation \’=A+ e¢ in Eq.(8), 
the differential equation may be put into the familiar 
form, 

aN, d IN, 


dl 
—1) -—| (D,\Not+ Div —| (9) 
dy’ dN’ dy’ 





where )’ is referred to an axis fixed at the undiffused end 
of the specimen. Inspection shows that 


D= D,Not+DM,, (10) 


where D is identical to the chemical diffusivity which is 
obtained from the usual Matano analysis of sandwich- 
type couples. The Matano analysis in its usual form, 
therefore, may be applied to the vapor-solid couple. 
The above relationships are valid for unidimensional 
diffusion, where dimensional changes normal to the 
direction of diffusion are negligible, no porosity is 
present, and lattice parameter changes are compensated 
for. Allowance is made for changes in lattice parameter 
by using a distance coordinate x in the diffusion 
direction, which is chosen so that equal increments of x 
correspond to equal numbers of atoms, and is defined by 


V, 
dx=—dy, (11) 
V 


where y is the experimentally measured coordinate, V 
is the local atomic volume, and V, is the atomic volume 


+ The writers are indebted to Mr. H. Fara of Sylvania Electric 
Products Inc. for the above proof. 
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of a reference alloy.* In the systems investigated, the 
lower vapor-pressure material was always selected as 
the reference alloy. 

An important consequence of the fact that N,;=f(A) 
satisfies the diffusion equation is that the concentration 
in the vicinity of the marker originally at the join (or 
originally at the surface of the vapor-solid couple) must 
be independent of time. This may be shown as followst 

The velocity of any marker referred to the nondiffused 
end is given by :5 


ON, 
on= (D.—D)( ) (12) 


Ox 





The diffusivities are functions of concentration alone 


«nd therefore 
Xm 
(D.-D)=¢(=). 
ti 


Hence, (13) 
dXm (=) ; (* 1 
0 fe arg ; aa. ae 
dt fi ose 


A general solution of Eq. 13 cannot be obtained since 
the functions f’ and g are not known. However, by 
substitution in Eq. (13) it is seen that x,= Ki} is a 
particular solution when K is chosen, so that 


K=2g(K)- f’(K). (14) 


Furthermore this is the particular solution for the 
marker initially at the join, since x=0 when ¢=0. Since 
the distance traveled by the marker and all composition 
levels varies as ¢?, and initially the composition at the 
join is indeterminate, there must be one composition 
level which remains associated with the marker as it 
moves away from the original interface. 

It follows from the preceding that the intrinsic 
diffusion coefficients determined from marker location 
refer to the particular composition at the marker, 
which remains constant during the experiment. 


Ill. EXPERIMENTAL 


Diffusion runs were made using the vapor-solid 
couples zinc-alpha brass, copper-nickel, and silver-gold. 
A major objective of the present work was to check the 
range of conditions under which the assumptions used 
in the preceding analysis are realized in vapor-solid 
diffusion couple experiments. The results for each 
system are presented separately below. 


a. Zinc-Alpha Brass 


Experiments designed to obtain the ratio of diffusi- 
vities of zinc and copper in alpha brass using vapor- 


6 C. Wagner, J. Metals 4, 91 (1952). 

t The discussion of this point by Darken (see reference 5) has 
been questioned by LeClaire (see reference 7), and it therefore 
seemed desirable to present a separate proof. 

7A. D. LeClaire, Discussion to reference 5, Trans Am. Inst. 
Mining Met. Engrs. 175, 194 (1948). 
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Fic. 3. Diffusion penetration curves of alpha brass 
couples diffused 9.25 and 32.25 hr at 890°C. 


solid couples have been previously described. The ratio 
of diffusivities was determined by finding the ratio of 
the number of zinc atoms and copper atoms which 
diffused past the markers. When the inert markers re- 
main at constant concentration, the ratio of diffusivities 
at the specific marker concentration is given by the ratio 
of the total number of atoms of each component diffused 
past the markers in opposite directions and, therefore, 


D,/D2=Q,/Q», (15) 


where Q and Qs», the total number of atoms of 1 and 2 
diffused past the markers, may be obtained by suitable 
graphical integration under the diffusion curve. 

Two 0.25 in. thick —1.25 in. diameter discs of OFHC 
copper were prepared, having a coarse grain size of 1 
mm. After inert particles of alumina were placed on the 
specimen surfaces the disks were sealed in evacuated 
quartz capsules along with a large zinc source of fine 29 
atomic percent zinc-alpha brass chips, and were an- 
nealed at 890°C for 9.25 and 35.25 hr, respectively. The 
penetration curves are given in Fig. 3. The surface con- 
centrations at both times were identical and were 
essentially the same as the composition of the chips 
after diffusion, indicating that approximate equilibrium 
was maintained between the chips and the surface. 
(The vapor pressure of zinc is about 50 mm Hg at this 
temperature.) 

The expansions of the specimens and the marker 
movements, which were obtained from the difference 
between the specimen expansion and the depth of 
burying of particles (see Fig. 1), varied parabolically 
with time as shown in Fig. 4. Such a variation is in 
accordance with the solution V,=/(A), which satisfies 
the differential equation for diffusion, providing the 
concentration at the vapor-solid interface is independent 
of time as in the present case. Satisfaction of the pre- 
scribed boundary conditions is also indicated by con- 
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Fic. 4. Surface movement and marker movement versus /? 
for alpha brass couples at 890°C. 


stancy of composition of the original interface (x=0 in 
Fig. 3), in conformity with the solution V,=/(A). 

Since the specimen expansion is solely the result of 
the gain in zinc, the total movement of the surface is 
quantitatively equal to the area under the diffusion 
curve. The position of the original interface was there- 
fore obtained by graphical integration under the diffu- 
sion curves and was found to agree closely with the 
position found from the measured specimen expansion 
in each case. Careful metallographic examination of the 
diffusion zone revealed no evidence of porosity for- 
mation. 

The chemical diffusivity was obtained as a function of 
concentration from the 35.25-hr diffusion curve by the 
Matano analysis and is shown in Fig. 5. These values 
agree fairly well with previous data obtained from sand- 
wich-type couples, recently summarized by da Silva and 
Mehl.‘ This agreement indicates that the effect of the 
small amount of porosity developed in copper-alpha 
brass sandwich-type couples is quite minor. 

The values of the intrinsic diffusivities of zinc and 
copper were then calculated from marker movement 
and chemical diffusivity data using Eq. (12) and (10) 
and from marker movement, and area ratios using Eq. 
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Fic. 5. D versus concentration for alpha brass couples at 890°C. 
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(12) and (15). Both methods yielded the same results 
which are given in Table I. These values refer to the 28 
atomic percent zinc composition at the markers. 


b. Copper-Nickel 


Fine alumina particles were placed on the surfaces of 
2 in. thick, 1-in. diameter high-purity nickel discs 
having a grain size of about 1 mm. These specimens 
were then exposed to copper vapor on both faces for 
various times in a continuously evacuated furnace at 
1060°C. The copper vapor was supplied from similar 
copper discs which were placed with faces parallel at a 
spacing of 2 mm. The vapor pressure of copper (2 10~ 
mm Hg) is about 105 that of nickel at this temperature, 
and the spacing between disks was several orders of 
magnitude less than the mean free path of the copper 
atoms in the vacua obtained. Chemical analysis of the 
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Fic. 6. Surface movement and marker movement versus 1} 
for copper-nickel couples at 1060°C. 


copper disks after diffusion proved that negligible 
nickel was lost from the nickel to the copper. 

For this system it was obvious from the color of the 
specimen surfaces that the composition at the vapor- 
solid interface changed over a considerable period of 
time. Marker movements were obtained by sectioning 
the couple and taking the difference between the speci- 
men expansion (surface movement) and depth of bury- 
ing of the inert markers. The metallographic examina- 
tion also revealed that no porosity formed anywhere in 
the diffusion zone. The marker movement did not vary 
parabolically with time as shown in Fig. 6. Hence it 
appeared that the boundary conditions were not closely 
enough satisfied for application of the Matano analysis, 
and values for D could not be determined from the 
diffusion penetration curves. 

Although the vapor-solid couple was not suitable for 
measuring D in the copper-nickel system, the difference 
of intrinsic diffusion coefficients (Dcu— Dy) was calcu- 
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TABLE I. Intrinsic diffusivities in various systems. 
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Evaporation 








rate of 
Diffusivity volatile 
difference Chemical component Evaporation rate 
: (Di —D2) diffusivity (D) Intrinsic diffusivities (moles/cm?/ (——) 
System Temp (°C) Composition (cm*/sec) (cm?/sec) (cm?/sec) sec) D 
Alpha brass 890 28.0 atomic 5.1X10-* 4.8X10-* Dzn.=6.3X10-* 1X10? 2X 105 
percent Zn Deu=1.2X10-* 
Cu—Ni 1060 83.0 atomic 2.0X 10-” SOXIS'= * Dou=3.7X10-” 4x10-* 80 
percent Cu Dyi=1.7X10-" 
Ag—Au 940 63.5 atomic 5.6X 10° 3.8X 10 Dag=7.3X10° 3X1077 80 
percent Ag Dau=1.7X10 








a See reference 8. 


lated from marker movement velocities obtained from 
Fig. 6, and the penetration curves of the couples 
diffused for 114 and 259 hours. The composition at the 
markers in both couples was 82 atomic percent copper. 
Using the marker velocity data at 114 and 259 hours 
obtained from Fig. 6 and values of the gradient obtained 
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Fic. 7. Diffusion penetration curve of silver-gold couple 
diffused 64 hr at 940°C. 


from the diffusion-penetration curves, (Dcu— Dwi) at 
83 percent copper was computed. The value of D at 
this composition was taken from the recent data of 
Thomas and Birchenall,* and the intrinsic diffusivities 
were calculated using Eqs. (10) and (12). All values are 
given in Table I. 


c. Silver-Gold 


High-purity gold disks 3 in. thick and ? in. in diam. 
were prepared with a coarse grain size of about 1 mm. 
Fine tungsten particles were placed on the surfaces of 
these specimens which were then exposed to silver vapor 
for various times in a continuously evacuated furnace 
at 940°C. The silver was supplied from pure silver 
disks which were placed in an arrangement similar to 
that already described for the copper-nickel system. 
The vapor pressure of silver (2X 10-* mm Hg) is about 
5X 10° that of gold at this temperature, and the spacing 
between disks was several orders of magnitude less than 


8D. E. Thomas and C. E. Birchenall, J. Metals 4, 867 (1952). 


the mean free path of the silver atoms in the vacua 
obtained. Chemical analysis of the silver disks after 
diffusion proved that negligible gold was transferred in 
the opposite direction. 

The diffusion penetration curve at 64 hr is shown in 
Fig. 7. The chemical analysis of slices containing less 
than 30 atomic percent silver was considered uncertain 
due to difficulties in separating silver and gold, and, 
therefore, this portion of the curve is shown dotted. 
The 90 percent silver surface concentration is some- 
what below the desired value of close to 100 percent 
silver. After 4 hr the surface concentration was found 
to be 80 percent silver and, apparently, the surface 
concentration slowly increased during the major portion 
of the 64-hr period. Hence, the solution V=f(x/t') 
cannot apply strictly, but the curve of marker move- 
ment versus square root of time, Fig. 8, nevertheless 
appears linear within experimental error. This observa- 
tion indicates that an observed apparent parabolic 
marker movement cannot be used alone as a sensitive 
criterion that the desired boundary conditions are 
strictly maintained. 

Metallographic examination of the diffusion zones 
also disclosed the progressive formation of porosity in 
the region between the specimen surface and the inert 
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Fic. 8. Surface movement and marker movement versus @ 
for silver-gold couples at 940°C. 
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Fic. 9(a). Diffusion zones in vapor-solid silver/gold couples at 
940°C. 200. Above: 4 hr. Middle: 12 hr. Below: 64 hr. th 
indicates position of Kirkendall interface.) 


markers which produced a deviation from parabolic 
surface movement at longer times. Since such porosity 
was found only in the silver-gold vapor-solid couples, 
a detailed study of its formation was carried out in 
both vapor-solid and sandwich couples. The sandwich 
couples containing inert tungsten particles at the 
original interface were made by hot welding together 
disks of polycrystalline silver and gold. Typical micro- 
structures are shown in Figs. 9(a) and 9(b). 

It is evident that porosity tends to form on the silver- 
rich side of the Kirkendall interface in both types of 
couple, and that the amount increases with time. After 
64 hr the total volume porosity between the markers, 
and the surface of the vapor-solid couples was found to 
be 8 percent by lineal analysis, and after 93 hr 13 percent 
was found. It is also apparent that the finely distributed 
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Fic. 9(b). Diffusion zones in sandwich type silver/gold couples 
at 940°C. 200X. Above: ~1 hr. Middle: 2 hr. Below: 10 hr. 
(Arrow indicates position of Kirkendall interface.) 


porosity formed early in the diffusion cycle next to the 
Kirkendall interface, is continuously eliminated. This 
latter observation agrees directly with the conclusions of 
Seith and Kottmann® who have made similar observa- 
tions in silver-gold sandwich couples. 

Even though the boundary conditions were not 
exactly obeyed, the Matano analysis was applied to 
the curve of Fig. 7. Porosity effects were still small in the 
64-hr specimen and minor corrections$ were made in 
Fig. 7 to account for the small amount of porosity be- 
tween the surface and the markers at this time. The 
resulting D values along with other values obtained by 
previous investigators, are given in Fig. 10. The agree- 

* W. Seith and A. Kottmann, Angew. Chem. 64, 379 (1952). 


§ For further details see R. W. Balluffi, discussion to reference 8, 
J. Metals 5, 726 (May, 1953). 
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ment of our approximate values with previous data is 
fairly good. It should be remarked here that the results 
of Ebert and Trommsdorf”® and Johnson" using in- 
cremental couples are probably more trustworthy than 
the values of Seith and Kottmann.’ The latter values 
were obtained by the Matano analysis and may be 
subject to significant porosity errors. Their less de- 
pendable values in the silver-rich range have been 
omitted. 

The difference between intrinsic diffusivities (Da, 
— Day) at 63.5 atomic percent silver was next calcu- 
lated from Eq. (12) using the flow velocity and gradient 
at the marker (see Table I). As previously indicated 
this method does not depend upon the validity of the 
solution N,=f(A) and therefore applies even when 
variations in the surface composition occur. Using the 
value of D at the marker composition obtained from 
Fig. 10, intrinsic diffusivities were calculated by use of 
Eqs. (10) and (12) and are presented in Table I. Values 
of Dag and Da, calculated from Eqs. (12) and (15), 
using the ratio of diffusivities obtained from the hatched 
areas in Fig. 7, were the same. 


d. Dimensional Changes Normal to the 
Diffusion Direction 


Direct measurements were made of dimensional 
changes normal to the diffusion direction in the copper- 
alpha brass and copper-nickel couples. In the alpha- 
brass system, measurement of the original and final 
diameters of the couple diffused at 890°C for 35.5 hr 
proved that normal expansion was less than 0.1 per- 
cent. Since the maximum percentage increase of the 
lattice spacing amounted to about 1.7 percent, this 
result proves that dimensional changes, even those 
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Fic. 10. D versus concentration for silver-gold at 940°C. 
See reference 12 for silver self-diffusion values. 





0 H. Ebert and G. Trommsdorf, Z. Electrochem. 54, 294 (1950). 
4W. A. Johnson, Trans. Am. Inst. Mining Met. Engrs. 147, 
331 (1942). 
a Lazarus, and Tomizuka, J. Appl. Phys. 23, 1032 
1952). 
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Fic. 11. Polygonization visible on surface of alpha brass couple 
diffused 35.25 hrs at 890°C. 200x. 


resulting from the increase in lattice parameter, are 
largely restricted to the diffusion direction. This obser- 
vation agrees with the result of da Silva and Mehl‘ who 
found no normal dimensional change in the brass side 
of a massive copper-alpha brass sandwich-type couple. 

Similar results were obtained for the copper-nickel 
couples. A linear expansion of less than 0.2 percent was 
found in the couple diffused at 1060°C for 259 hr, 
although the maximum increase of lattice spacing 
amounted to about 2.5 percent. Normal dimensional 
changes of the foregoing magnitudes would have neg- 
ligible effect upon any quantitative calculations and the 
original assumption of constant specimen cross section 
seems to be closely approximated. 


e. Polygonization in Diffusion Zone 


In a previous note,'* one of the writers has reported 
metallographic evidence of polygonization in copper- 
nickel diffusion couples. Upon observation of the surface 
of a vapor-solid copper-nickel couple directly after 
diffusion, characteristic grooves at the intersection of 
polygon boundaries with the surface were plainly evi- 
dent. Consequently, the final surfaces of the present 
couples were carefully examined. 

Figure 11 shows the surface of the 35.25-hr brass 
specimen revealing a typical polygonized structure. 
Several slip lines prove that the orientation difference 
between adjacent subgrains is quite small. No subgrain 
structure was visible on the surface of an identical 
copper disk heated in the absence of zinc vapor. 

Careful examination of the surfaces of the silver-gold 
specimens failed to disclose any visible subgrain bound- 
aries. However, a Laue back reflection x-ray photogram 
of the surface layer of the specimen diffused 64 hr at 
940°C revealed multiple spots, and evidently this speci- 
men was also polygonized. 


'3R. W. Balluffi, J. Appl. Phys. 23, 1407 (1952). 
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IV. SUMMARY AND CONCLUSIONS 


The present work indicates that absolute measure- 
ments of intrinsic diffusion coefficients may be made for 
binary systems using the present vapor-solid couple 
method whenever the vapor can be supplied to the 
specimen surface at a sufficient rate to raise the surface 
concentration close to the concentration of the vapor 
source within a very short time. When this condition is 
satisfied the proper boundary conditions are maintained 
and chemical diffusivities may be obtained from the 
diffusion curve using the Matano analysis. Intrinsic 
diffusion coefficients may then be calculated from the 
position of inert markers initially placed at the original 
interface. 

This condition is presumably met if the rate of de- 
livery of the volatile metal to the specimens surface is 
sufficiently high compared to its rate.of diffusion into 
the metal. The ratio of the evaporation rate to the 
diffusion coefficient may therefore be used as a very 
approximate empirical criterion for the applicability of 
this type of couple to various systems. In the present 
work the technique was successful for the diffusion of 
zinc into copper at 890°C. At this temperature the 
ratio is about 2X 10° (see Table I). The technique was 
doubtful for the diffusion of silver into gold at 940°C 
and copper into nickel at 1060°C where the ratios are 
both about 80. 

When the rate of material transport through the 
vapor phase is too low to maintain a constant surface 
concentration, vapor-solid couples may still be used to 
obtain an exact value for the difference of intrinsic 
diffusivities since this quantity can be calculated from 
the experimental data without any assumptions con- 
cerning the boundary conditions. If values of chemical 
diffusivity are known from other experiments, the in- 
trinsic diffusivities may then be calculated. 

The intrinsic diffusivities in the copper-alpha brass 
system obtained by the present method are probably 
more accurate than those derived from sandwich type 
couples. This is shown quite clearly by the negligible 
scatter of points representing marker movements in 
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Fig. 4 compared to the broad scatter encountered in 
the data of da Silva and Mehl.‘ The ratio of Deu to 
Dy {= 2.2 calculated from the present data is consider- 
ably greater than the approximate ratio of 1.3 obtained 
from sandwich type couples in the copper/nickel system 
by Thomas and Birchenall.® It is felt that the use of the 
vapor-solid couple to determine intrinsic diffusion coeffi- 
cients insures direct and accurate measurement of 
marker movements and avoids problems associated 
with placement of markers and reference foils and with 
graphical methods of determining the original interface.‘ 
Porosity formation is also avoided in some systems. 

Careful length measurements prove that expansion 
normal to the diffusion direction is negligible in the 
present specimens, and it appears that in massive 
diffusion couples such expansion is restrained by the 
undiffused bulk of the couple. This conclusion is borne 
out by the measurements of da Silva and Mehl‘ but is 
contradicted by Seith and Kottmann.’ However, there 
are some discrepancies in the measurements of Seith 
and Kottmann which are discussed elsewhere. It 
appears that, after correcting for changes in atomic 
volume, the assumption of a constant number of atoms 
per equal increment of distance is closely approximated 
in massive diffusion specimens. 

In agreement with the observations of Barnes! the 
present work indicates that polygonization in the diffu- 
sion zone is a frequently occurring phenomenon. Poly- 
gonization is probably the result of the mass flow 
associated with the unequal diffusion processes and 
may be intensified because the specimen expansion is 
not isotropic, as it tends to be, but is restricted to the 
direction of diffusion by the bulk of the couple. 
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4R. W. Balluffi and L. L. Seigle, Naturwiss. 20, 524 (1953). 
16 R. S. Barnes, Proc. Phys. Soc. (London) 65B, 512 (1952). 
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ed (Received May 11, 1953) 
m 
he The problem of finding the density of ac energy and the ac power flow in an electron beam, originally 
‘fhi- uniform in velocity and charge density, which interacts with a weak electromagnetic traveling wave, is 
of considered. For the problem to be significant, the way in which the traveling wave is built up must be speci- 
ed fied and the energy and power flow turn out to depend upon the method of establishment. 
th 
e.* 

N the discussion of traveling-wave tubes and similar average ac energy density and power flow in the beam 
on electronic devices, one often assumes for simplicity for large z and ¢. 
he that the electromagnetic fields have the space and time Before attempting to answer this question it is of 
ve variation of a simple traveling wave. It is the purpose of interest to note that the rather meaningless question 
he this note to discuss some considerations which must be asked originally may be answered plausibly by using 
ne taken into account if this picture is used. In small small signal theory uncritically. For, using the usual 
is signal theory the amplitude of the fields is supposed to complex notation of linear theory, one may write 
re be so small that the equations of motion and continuity 
ith may be linearized by the neglect of products of ac dp Opy 
It quantities. In such a theory the ac velocities and cur- es , 
nic rents in the beam and, hence, the admittance of the 
ms latter are readily found. It is of some interest to know avs 
ed the density of stored kinetic energy in such a beam and, —f-9o— = — eilottts), 

also, the flow of ac power. Since these are second-order Oz 


i. quantities in the applied ac field amplitude, their 
| evaluation would appear to require the analysis to be 


“a carried beyond the usual linear terms. For such terms 

will will contribute to the energy as the product of the 
ot second-order density term and the square of the zero- 
t. order velocity. 

ho It will be sufficient to consider the simplest problem 


of this type. Suppose that a beam of electrons, homo- 
geneous in velocity vp and charge density po, interacts 
with a weak traveling-wave electric field in the direction 
of its motion (along the positive z axis). Suppose that 
nd i sae ; 
ven all forces are longitudinal and of magnitude —\) cos(wt 
+Iz), where \ is small in the sense that it causes small 
changes in velocity and charge density. The question 
. is now asked, ‘‘What is the ac energy density and the 
: ac power flow?” Now it is clear that as thus formulated 
the question is incomplete. For the applied field extends 
indefinitely in space and in time, and no connection is 
established with the “original” homogeneous state of 
the beam. To make the problem meaningful and also 
relevant to the kind of application which is usually 
made of models of this kind, one may suppose that the 
field has been established in some gradual fashion. 
Specifically, the force on the electrons will be taken as 


—a(z,t) cos(wi+Tz), 


where a(z,t)—0 as z or /->— © ; a(z,l)—>1 as 2,t become 
large and da/dz, da/dt may be made arbitrarily small. 
Thus the field amplitude varies arbitrarily with posi- 
tion, but builds up very slowly to the amplitude X for 
all large z and ¢. One may now legitimately ask for the 





where p and v are the charge density and velocity in 
the beam. A will be given by eE/m, where e and m are 
the electron charge and mass and E is the electric field. 
Writing p=potpie’@*l) and v=a+ etl), and 
making the linear approximations, we have 


(w+ T'v)pit+ T’ pov =0 
(w+T'vo)v1=jr 


pr 
y= —_—_—— 
(w+T'v) 
—jT pod 
it (w+T'00)”” 


and for the rf current, 1;= pov1+-vop1, 
Jwpor 
————-, 
(w+Tv)? 


Suppose that the field E does not vary in the x direction 
and let the beam have a small thickness 6 in the y 
direction. The tangential magnetic fields, H,- and H+, 
on the —y and + sides of the beam satisfy 


jupodd 
(w+Tv0)? 
jopoi(e/m)E 
HT)? 


1 


H.=H,--— 


=a z* 
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The power entering the beam per unit area is then 


copab(e/m) 
eta INS 2 (1) 
me (lz I, ( 


The quantity wpod(e/m)/(w+T'v)* will be called the 
susceptance of the beam and written as B(w,I'). Thus 
the power absorbed per unit area will be 


} Re(jB@,T))| El? 


and will vanish for real w and I. 

Now suppose the ac energy storage per unit length 
in the beam to be 3K|£|*. Imagine that w is slightly 
complex, so that w=w:+jwe, while I is real. Then 


} Re[ E*(H.-— H+) |= 5 Re 





BlsT)=BlouT)+- jon 


Ow 


0B a) 


and 


0B 
RejB(w,T) = ~n(—) ; 
Ow ww) 


E however varies as exp(jwi—w2)t and |E|? as 
exp— 2wet, so that the rate of change of stored energy 
per unit length is 


0 
—3K | E|*= —wK| E|’, 
ot 


and this must equal the rate at which energy enters the 
beam. So we have 


oB 
osK|El=tu(—) |E|? 
Ow w=] 


1/0B 
K=-(—) (2) 
2 Ow a=] 


This is a result familiar in the ordinary analysis of 
reactive networks.* By an exactly similar argument 
keeping w real and making I slightly complex, we find 
that if }/| £|* is the ac power flow in the beam then 


1/0B 
ra), 
2\or/rer; 


Returning to the more definitely stated problem it 
will be convenient to suppose that the undisturbed beam 
contains, not a single velocity of electrons, but a dis- 
tribution of velocities. It will be supposed however 
that all electrons travel in the same direction and that 
none is synchronous with the velocity wI of the travel- 
ing wave. The relative number of electrons in the 
modulated beam in the element z, z+dz with velocities 
between v and v+dy is written as f(z,t,v)dzdv, so that 
po Jo” f(z,t,v)dv is the charge density at z and ¢ and po 


and 


*An additional factor 4 appears here because of the space 
average involved. 
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is the charge density of the undisturbed beam. The 
function f satisfies 


ts] of a 
of ss # cos(wt-+Tz), (4) 
Ot dz dav 


for this implies the conservation of the number of elec- 
trons in phase space. 
Now, put 


f=fo(v)+Afi(z,t,0) +A fo(z,t,0) + si (5) 


where fo(v) is the velocity distribution in the original 
beam. Substituting in (4) and equating terms of the 
same order in \, one finds 


Of: Of: Ofo 
—+ 


v—=—a(z,t) cos(wi+Tz), (6a) 
ot dz Ov 
ts) fs) 
ofs ‘eee t) cos(wt+Tz), (6b) 
at dz Ov 


and so on. To solve these equations f; and fe are 
written in the form 


fi=A cos(wi+Tz)+B sin(wi+Trz), 
fe=C+D cos2(wt+Tz)+E£ sin2(wi+Tz), 


where A, B, C, D, and Eare functions of z, ¢, and v. When 
these expressions are inserted for f; and f2 and similar 
trigonometric terms collected the following equations 
arise: from (6a) 


a @ fo 
(<+—)a+ (w+ I'v) B=—a(z,t), (7a) 
at az dv 


0 6(6@ 
+0 )B— (w+T'v)A =0; (7b) 
ot oz 

from (6b) 


“+0—) C+ (- +e—)p cos2 (wt+-T'z) 
t 


ot az dz 
0. 6@ 

+ ~ +0) E sin2ol+P)—2(ort Pe) 
Ot az 


XD sin2(wi+Tz)+2(w+Tv)E cos2(wi+Tz) 
OA 
= a(z,t) cos(wi+ r|— cos(wt+Trz) 
v 


0B 
+— sin(wi+Tz) ‘ 
Ov 
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or 
0 @ 0A 
(- +0 C= $a(z, of (8a) 
ot daz 
fe] OA 
(- +) D+ 2(w+T'v) E=4$a(z,t)— (8b) 
Ot az Ov 


a a 0B 
(- +o) E- 2(w+T'v)D= fa(z,t)—. (8c) 
t 


Oz Ov 


Eliminating B from Eqs. (7a) and (7b) one obtains 


3 0\2 0 fe ofa fe) 
| (<+2—) + troe|4-—(<+ avon. 
Ot oz Ov \Ot dz 


The right-hand side of this equation is small because 
the rate of change of a is small and one may write 


dfo/av fa 9 
= (<+— Jae. 


(w+Tv)?\ot dz 
1 re) 0 \? 
_ (<4) 4. (9) 
(w+Iv)*\at dz 


It is clear that A may be expanded in a series of the 


form 
A. re) 0 \ 29-1 
—— ee 
(w+TIv)*"\ot dz 


and this will converge rapidly if w+I'v+<0, since 


rs} Q 2n—1 
( +0) a(z,t) 
Oot Oz 


may be chosen to be as small as one desires. In a similar 
fashion it is seen that C, D, and E are also very slowly 
varying functions of z and ¢. Thus, when average energy 
densities and power flows are calculated only the term 
C in f will contribute, all the others being attached to 
trigonometrically varying terms and thus averaging 
nearly to zero. Now C has to satisfy 


a @ a(z,!) 9f df/av sa a 
(- +i )e=" -| ~+0—)a(s) 
Ot az 2 dvl(wtTv)*\dr dz 


+smaller terms 


_aG)Ta / afo/a \ 7a a 
—( =) (-+—)ateo 
2 Lav\ (w+Ts)?/ Nat az 


Afo/av da 


(w+Tv)? dz 

















+-:--. (10) 





Thus, 


@ a 19f dfo/dv 
(ot) eee foo) 
Ot az 4 dul (w+Tv)? 


9 fo/dv a 
~ A(T 0)? = 
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The solution of 


(- +o—)r( t,v) = G(z,t,2), (12) 
ot az 


which tends to zero as ¢ and z tend to — ~ is given by 
P io) 
Peesa)= f G(z—v0,t—0,v)do. 
0 


This may easily be established by changing from the 
variables, z and ¢, to z—vt and ¢ in the Eq. (12). Thus, 
C is given by 


Ofo/dv 
C=- | a*(z,!) 
4 ov (w+Tv)? 


Ofo/dv 
AwtTo)?J, dz 


r 0a" 








(s—vo,lt—a)do, (13) 


and for large z and / 
_ 1Ldf Afo/do 
ares meee 
PR 
Vern ))? se 








im foto e de (14) 


The fact that this expression for C depends explicitly 
upon the form of the function a(z,t) indicates that the 
problem which has been set does not have a solution 
which is independent of the way the field is built up. 
In general, then the stored energy and power flow are 
not uniquely determined by the first-order state of the 
beam. 

Since the stored ac energy density of the beam is 
proportional to $fo*f2v’dv and the ac power flow to 
So" fov'dv, it is convenient to consider },fo*f.v"d», 
which will be written as (v"). Suppose that the two ex- 
treme cases are now considered; first, that in which 
a(z,t) has no z variation and, second, that in which it 
has no ¢ variation. Write (v"); and (v")» for the integrals 
in these cases. Then 


lr”? df 0 fo/dv 7 
wa f g— dv 
43, dvi (wt Tr)? I 

















1 ood v"d fo/dv n r* v""dfo/dv 
on 8 ibe f Anton 
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4J, do Got Tvy 


Pi gt 
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provided fo(v) tends to zero sufficiently rapidly at 0 
and «. In the second case, since 


da? 0a* 1 da* 
—(z—v0,t—0) = —(z—v0) = —- —(z—10), 
Oz Oz v Oo 

one has 


es) 0a’ 1 «© 
lim —(z—vo,t—a)do= -|-2(e-w)| 
o.oo 0 Oz v 0 





1 1 
=- lim a?(z)=- 
g v 
and 
in” é dfo/dv ie dfo/dv 
(or)=- f | leo f go? —___—_ dp 
4J9 = dvL(w+Tv)? 4J, (w+Tv)? 
== f fo falo) “| gr! —|z (16) 
v v. 
dol (w+ Tv)? 


Thus, generally, 
(v)o/{v") = (n—1)/n. 


What is the relation of these results to those obtained 
from the linear theory? The expression for the stored 
energy was derived on the assumption that the field 
had a fixed space periodicity and should therefore 
correspond to (v),. Now, 


, =f = ( v . 
Was] fol) es 
1 ¢” 1 2rv 
= o\v nad d 
| Sol oer = : 
--f fo(®) “| Tv 1 | 
v)— v 
dul (w+Tv)? ~ tT) 


or! g o-( el (17) 
. af Wo = . 


This agrees with the earlier result since the total stored 
energy of the distribution is the sum of the stored en- 
ergies of the separate streams. The signs and nu- 
merical factors are correct by the following considera- 
tions. The energy density is \*m(v*) multiplied by the 
number of electrons per cc. \ is e/mE and the number 
density is —po/e. Thus the energy density is — (epo/ 
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m)(v°)E? and the beam has a volume 6 per unit area, 


giving an energy storage. 
epaw/ms 
eal =f foe ke 
‘i (w+Tv)? 
Similarly the earlier expression for the power flow 


was based upon the field’s having a constant time 
periodicity and might coincide with (v*)2. But, 


9 


== f f—(——)a 
?).=- v)—{ ———_}dv 
; | i dv\ (w+Tv)? 
1 ¢* 20 2rv* 
== so0(——-—__ Jas 
275 (wt+Tv)? (w+Tv)* 


i 7 2wv , 
-3J Jets eee, 


ae :)a (18) 
(o+Toy2) 


This is again in agreement with the earlier result. 

The conclusions to be drawn from this analysis, then, 
are that for a traveling wave field established in a 
variety of ways the stored kinetic energy and power 
flow will be different from one case to another. Further, 
linear theory will give correct results if correctly applied. 

It should be noted that the expressions for (v); and 
(v*)2 indicate that the stored energy or power flow may 
be positive or negative. Negative stored energy implies 
that work was extracted from the original dc beam in 
bringing it to its modulated condition. Clearly a beam 
with negative stored energy is potentially unstable in 
the sense that it could pass to a state of greater modu- 
lation by doing work on its surroundings. (v*), for a 
bean of velocity v is proportional to (w—T'v)/(w+Tv)' 
and if —w/I is written as v,, the wave velocity, it 
follows that 





wiles jf ox 





(v*); is negative then, for positive v,, if v<—v, or 
V> Vy. The second case is the more interesting since the 
negative stored energy can be larger for this case. The 
energy is negative here if the wave velocity is less than 
the electron velocity and positive if the wave velocity 
is the greater. 
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Distinction between data transmission from one person (or point) to another and the inception of new 
knowledge is analyzed in relation to information theory. The customary measure H = —2;p; logp; applies 
essentially to the former case, and after discussing the requirements appropriate to a yardstick for new 
knowledge an elementary measure for this concept is tentatively proposed. The intuitive requirements, 
however, are not all mutually compatible and the present measure can only be regarded as provisional; in 
particular some “memory” characteristics appear mandatory for future development. 





INTRODUCTION 


I* a provocative paper recently, Brillouin’ is led to 
suggest that we may have to consider two differ- 
ent kinds of information. The first he calls: Abso- 
lute Information ‘“‘which exists as soon as one person has 
it, and should be counted as the same given amount of 
information, whether it is known to one man or to 
millions.” The second is termed: Distributed Information 
which he defines as “. . . the product of the amount of 
absolute information and the number of people who 
share that information,” and he goes on to say “. . . 
Distribution of information . . . increases the amount 
of distributed information without changing the total 
absolute information available.” 

Expressed otherwise, it appears to me that Brillouin 
wishes to distinguish, on the one hand, between new 
knowledge and, on the other, the transfer of data from 
one person (or point) to another. The difficulty of 
recognizing and resolving this distinction has indeed 
made itself felt at various times before now, but I 
believe that the correct solution has not yet been found; 
this paper makes a suggestion towards that end. In- 
formation theory, as brought to the fore by Shannon? 
particularly, presents to us the quantity 


H=—X,p; logsp; (1) 


as a measure of the amount of information in units of 
“bits” obtainable from some source, and this seems 
admirably suited (being indeed designed for the pur- 
pose) for the description of the transfer of data within 
some prescribed frame of reference from one point to 
another. It appears however that the measure H is not 
suited for the assessment and discussion of new knowl!- 
edge, and we shall try here to find a more appropriate 
measure for this latter “quantity.” Apart from any 
academic or aesthetic considerations, this can only be of 
ultimate value if we find that such a measure can be used 
numerically to assess satisfactorily our gain or otherwise 
in some change of situation of primary knowledge; and 
this, at heart, must agree qualitatively with our 
elementary intuition. We have seen with great interest 


1 L. Brillouin, Am. Scientist 38, 594 (1950). 


2 C. E. Shannon, Bell System Tech. J. 27, 376, 623 (1948); 30, 
50 (1951). 


Gabor and Gabor’s (1953) essay: “A Mathematical 
Theory of Freedom,” wherein a numerical measure 
has been proposed for the effective freedom of action 
of a population based on a posteriori observation of 
the range of choice exerted by the population. While, 
as the authors themselves emphasize, so sophisticated 
a concept requires great care in any numerical inter- 
pretation, its possible quantitative application to social 
and economic problems such as effective electoral lib- 
erty, for example, appears most attractive. 


THE DISTINCTION BETWEEN DATA 
(“INFORMATION”) AND “NEW” 
KNOWLEDGE 


Let us consider a situation used to illustrate the con- 
ventional field of information theory. We suppose that 
we have a power station together with its generators, 
etc., situated in some isolated place; it is then very 
familiar that we can transmit electrical signals to the 
substation which will start, stop, or reverse the gener- 
ator, control its speed, and so on. This may then of 
course be compared with the process of sending verbal 
orders, or messages, from some transmitting station to a 
receiving post. In addition, however, to complete the 
communication link we should like the generator to be 
able to ‘“‘talk back” to us and this may quite readily be 
achieved in the sense that when the machine has in fact 
responded to a “‘request” from our end to start up, a 
signal can be sent back automatically as soon as, say, 
the armature has attained a prearranged speed. . . and 
so on. The communication link can readily be broadened 
further to permit us to be informed if, say, faults 
develop in the generating system. Thus, for example, 
a lightning surge on one of the circuits would cause an 
over-voltage relay to “trip” temporarily, and a dis- 
tinctive signal would then be sent to us, while, if no 
serious damage had occurred, the relay, on resetting 
automatically, would initiate a second appropriate 
signal. This framework is obviously susceptible to wide 
ramification and we are certainly fully justified in 
saying that we are “in communication” with the sub- 
station and that data (or information in the Shannon 
sense of the word) is being transmitted to and fro. All 
the usual calculations may be made (and applied) of 
channel capacity, optimum coding, etc. 
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However, it appears evident to the writer that so 
long as this communication system is working normally 
and fulfilling its expected function, no essentially new 
knowledge is being created. Thus, in order to design an 
appropriate communication system and make it func- 
tion efficiently, we must have first a knowledge (the 
word now being used primarily in its intuitive sense!) 
of the characteristic functions of the generator, etc., 
and more particularly of the faults which may occur 
therein. 

Given these and their expected relative frequencies of 
occurrence, say p; for the ith “fault,” etc., we can then 
proceed to design our system and use the familiar 
measure of information content per signal sent: 


H = —,p; logsp; (bits) (2) 


to analyze its performance quantitatively. 

If now in any “system”—ranging from the signalling 
of messages by a broken-down motor at one end of the 
scale to the delivery of lectures on, say, quantum field 
theory, at the other—the statistics of the “‘transmitter”’ 
(motor, lecturer) in the sense of the #,’s for all possible 
messages are truly known, then there is really no possi- 
bility of creating any new knowledge in the intuitively 
fundamental sense. All that one can do is to give (send) 
notification of the existence of situations already fully 
visualized and whose statistics are already known. 
Of course the probable trend of events will even 
be known beforehand if the communication system is to 
be used at high efficiency since then we shall have to 
know not merely the individual probabilities of the 
enumerable situations which may occur but also all the 
dependent probabilities: ;;;, pij:x, etc. (which are 
necessary for appropriate coding). 

This last conclusion suggests already a possible re- 
lationship of new knowledge within the established 
framework as opposed to “mere”’ data transmission with 
the fluctuations of,a data-transmission system from its 
most probable (average) behavior. We shall go further 
here, however, and suggest that it is only when the 
already-envisaged or established system is actually 
unable to cope with a situation which presents itself that 
any essentially new knowledge (or “absolute informa- 
tion” to use Brillouin’s terminology) is generated. Thus, 
to revert to our previous examples, a new fault, not 
previously encountered, occurs in our generator for 
which therefore no signal has been provided, or our 
lecturer has to attempt to introduce and define new 
words or to ascribe new meanings to words in fresh 
contexts not previously envisaged by the audience. 

A number of illustrative examples come readily to 
mind in this connection. It appears true to say that the 
science of mathematics only really progresses when we 
are forced to invent new symbols and/or to increase our 
vocabulary of functions and that significant progress in 
piysical science only occurs when we can no longer 
adequately describe the behavior of observed systems 
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within the existing terminology. If indeed the existing 
description proves entirely adequate in some field of 
investigation, we can go so far as to say that further 
research in that field is surely quite sterile. It is perhaps 
unwise to venture further afield, but we can at least 
notice that in the artistic domains, such as literature 
and music, the same trend is obvious when it becomes 
felt that existing modes of presentation are no longer 
adequate. 


A MEASURE FOR NEW KNOWLEDGE 


One can summarize the over-all process of evolution of 
natural science and the increase of knowledge, in terms of 
a “communication system” (see Fig. 1) as follows. 
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Before, say, an experiment we have some conception of 
the physical world in terms of possible “states,” to 
which we may assign probabilities ; in the light of 
our knowledge to date. After the experiment, and/or 
the analytical examination of available data, we shall 
in general make some reassessment of the #,’s in the 
light of our increased knowledge. [Such an interpreta- 
tion has indeed been applied quantitatively before to a 
specific experimental problem such as the analysis of 
received radar data, inevitably distorted to some 
degree by random noise (Woodward ;3 Woodward and 
Davies).‘] In this sense we may of course say that 
the problem of physical science is essentially one in a 
posteriori probabilities (cf. e.g., Fisher,® 1935). 

We therefore ask: if the probabilities before an “‘ex- 
periment” (in its most general sense) are p; and after- 
wards are P; how should we assess numerically our gain 
in knowledge on some suitable scale of units? Let us 
emphasize immediately that: first, it is not our province 
here to state how the ,’s and P;,’s are to be assessed,* 
any more than it is the province of conventional in- 
formation theory to assess, in general, the p,’s of the 
signals in a transmitting source; and secondly, that 
there will be no unigue measure of this kind—all that 
we can seek to provide (as with Shannon’s #/) is an 


*P. M. Woodward, Proc. Inst. Radio Engrs. 39, 1521 (1951). 

4P. M. Woodward and I. L. Davies, Proc. Inst. Elec. Engrs. 
(Pt. III) 99, 37 (1952). 

5R. A. Fisher, The Design of Experiments (Oliver and Boyd, 
Edinburgh, 1935). 

* For this we might turn again, for example, to Fisher (reference 
5), Chap. XI in particular. 
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objective numerical measure on a suitable scale which 
will agree as far as possible with our intuition and then, 
we hope, be useful for further discussions. Finally, our 
measure can only be relative to the situation in hand; 
thus, it would be meaningless to ask whether the state- 
ment: “the atomic weight of potassium is 39.1”: 
represents more (or less) knowledge than: “there is 
approximately one free electron per atom in sodium 
metal” just as it is meaningless. in general to ask 
whether a sonata by Beethoven is “‘better”’ (or “‘worse’’) 
than a picture by Picasso. 

In order, then, to formulate a suitable expression we 
must first suggest the requirements considered desirable 
that AJ—the change in “new information” or ‘‘knowl- 
edge”—should fulfil. 


(1) On the assumption that our “experiments” are 
always “sincere” (i.e. that no deliberately false data is 
inserted), AJ should never be negative. 

(2) AI should be monotonically ordered: that is to 
say, a change of probabilities: (4, 2) to (2, 4), for ex- 
ample, should yield a greater AJ than (3, 2) to (3, 4). 

(3) AI should be symmetrical : that is, a change from 
(3, 3) to (2, 3) should yield the same A/ as the inverse 
change. 

(4) AJ should be linearly additive in the sense that 
the successive changes (}; #)—>(3, 3)—>(%, 4) should 
yield the same result as the single change (3; %) to 
(3; 2). 

(5) A logarithmic basis appears desirable. 

(6) For analytical convenience, we should like to ex- 
clude the modulus sign as an artifice to ensure condition 
(1), and the expression should he as “simple” in form as 
possible. 


It is readily seen that these conditions are not all 
mutually compatible. Thus by conditions (3) and (1) 
the null change (p;;p;)—>(:3;p;) must yield AJ=0, 
while by (1) the successive changes: (p;;p,)—>(p,’ 3p;’) 
—(pi;p,) (where p,’~,) must yield an over-all AJ>0, 
since each change must give a positive AJ, and this then 
evidently contradicts condition (4). A possible measure 
satisfying all conditions except (4), is then: 


Il; 
AI =2 (II;—7;,) log (—) (3) 


where I],;= 1—P;; 7;=1— >; and the z’s may be termed 
the “improbabilities.” The choice of this form rather 
than the more immediately obvious 


P; 
AI==z (P;—p;) log— (4) 
t p ° 


will be justified below. The units will be “bits” if the 
logarithms are to base 2, or “nits’® if natural loga- 


*D. K. C. MacDonald, J. Appl. Phys. 23, 529 (1952). 





rithms are employed. The limitation of our measure for 
AI in (3), in contradicting condition (4), is illustrated 
by comparing on the one hand, successive transitions, say 
(wi1,+ ++ )—>(ai2,- + -)—>(a3,---), with the direct change 
(wi1,-+-)—>(ais,---). In these cases the difference of AI 
is given by: 


6= Al direct— AT sncccesive 


TG 33 
- > (33;— r:dlog(=)+ (w2:—71;) log—}. (5) 


Thi 25 


(It is then obvious that 6=0O if either (22;,---) 
=(m1;,---) or (msi,---)=(qe,--+), reducing the suc- 
cessive transition to a direct one.) Restricting ourselves, 
for simplicity, to the simplest case of dual probabilities 
(a, Ty), Say, it then follows readily that if the transi- 
tions are monotonic, e.g., 32> 2a> Tia, then 6>0. This 
would then suggest that more new knowledge is gained 
in this case by a single ‘‘experiment”’ yielding the transi- 
tion, T1a—73a, than by two successive “experiments.” 
Conversely, a nonmonotonic change (e.g., m3a<72a 
>a) yields 5<0. These implications do not, however, 
seem immediately acceptable to the intuition and indeed 
may well prove untenable. 

Some consideration of this situation suggests that a 
measure which did not merely take account of a single 
change of probabilities, but rather included some 
“memory” of previous transitions would be more nearly 
ideal. In more mathematical terms it appears desirable 
to recognize “increase of knowledge” as a non-Mar- 
koffian process. It seems interesting that we are analyti- 
cally driven to this conclusion since it would certainly 
appear intuitively agreeable that our increase of knewl- 
edge would depend not merely on the immediately 
previous assessment of the situation but on how this was 
earlier approached. 

It is evident from (3) that if any 2; (say m,) becomes 
precisely zero (and hence all other 7,’s become unity), 
AI becomes infinite. This corresponds to the achieve- 
ment of certainty (a “singular point”’) in our assessment 
of the probabilities (p,=1, all other p;,=0) and it is for 
this reason that (3) is preferred to (4), as remarked 
earlier. At first it appeared somewhat surprising that a 
transition, say, (9/10, 1/10)—>(1,0) should yield 
AI=«, but indeed this became intuitively very satis- 
factory because physically such a transition will corres- 
pond to the recognition of some natural law. The 
achievement of a natural law implies a certainty about 
the outcome of all further experiments in this field and 
consequently the infinite assessment is entirely agree- 
able. It is also worth noting, however, that the loga- 
rithmic dependence involves a very slow approach to 
infinity so that even (999/1000, 1/1000) is still “a long 
way” from (1, 0), underlining the importance of the 
vital step to the certainty of the physical Jaw. This 
evidently concords with Brillouin’s appreciation of the 
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apparent informational dilemma involved in the dis- 
covery and publication of a natural law. We may now 
indeed suggest that in this field lies an essential differ- 
ence between the human brain and any “calculating 
machine” or “electronic brain.”” A machine, given an 
accumulation of experimental data, etc., would be able 
to make certain quite precise deductions about the 
probability of further experiments, etc., but only the 
human brain can make the synthetic and inductive 
generalization involved in the statement of a natural 
law. Of course, if one does not wish to admit that there 
is any essential difference between the human brain and 
the machine, one may point to the fact that natural 


MacDONALD 


laws, as they have been enunciated in the past, (e.g. 
Newton’s laws) do not appear to stand the test of time 
indefinitely! . . . The debate continues. 


CONCLUSION 


We may perhaps sum up by saying that any com- 
munication system (in its most general sense) which has 
reached a state of “perfection” where all “redundancy” 
(or lack of efficiency as a conventional communication 
system) has been eliminated is impotent to produce new 
knowledge. While necessity may be the mother of in- 
vention it appears that efficiency is not always the 
father of knowledge. 
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Limiting Pressure on Hydrofoils at Small Submergence Depths 
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It is shown that the magnitude of the local velocity and the surface pressure coefficient can be limited over 
the entire upper surface of a two-dimensional hydrofoil as it approaches the free water surface. In this 
limiting condition the flow over the upper surface is governed by the shallow water wave propagation 
phenomenon, and the limiting pressure or velocity is determined by the maximum depth attainable from the 
hydraulic jump relations. The experimental data verifies this new theory and proves that the previous two- 
dimensional hydrofoil theories are valid only for relatively deeply submerged bodies. 





INTRODUCTION 


HE experimental data shown in Fig. 1 was ob- 
tained by Ausman,! who pointed out the ap- 
parent similarity of the limiting pressure coefficient 
(found on the upper surface of the hydrofoil when near 
the free surface) to the limiting value found when 
cavitation occurs. However, the magnitudes of the cut- 
off or limiting pressure coefficients shown in Fig. 1 are 
much less than those necessary for cavitation (when the 
local static pressure must drop to its absolute minimum 
equal to the vapor pressure of the water’). 
It will now be shown that this limiting pressure over 
a two-dimensional hydrofoil at small submergence 
depths is directly related to the phenomenon of shallow 
water wave propagation. This is physically evident if 
one considers a thin hydrofoil near the surface. Since 
its chord is relatively long as compared to its water 
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depth, therefore, the flow over its upper rear surface 
must be similar to the shallow water flow in an inclined 
channel. This analysis is very important since it verifies 
the experimental data presented which shows that the 
lift of a large chord two-dimensional hydrofoil becomes 
negligible as it approaches the free water surface. This 
is contrary to the prediction of all previous theories 
on the action of a hydrofoil* which are, therefore, in 
error for a finite chord hydrofoil as its submergence 
depth becomes small. 

This free water surface flow resistriction is illustrated 
in Fig. 2 where the experimental data show how the 
maximum negative values of the pressure coefficient 
decrease with the submergence depth once the theo- 
retical limiting value (solid line) has been reached. 
The experimental values presented in Fig. 2 are the 
maximum negative values of the pressure coefficient 


-2 EQ (7) 








PERCENT CHORD 
20 40 





pressure distribution on NACA 0 





60 











4412 profile at various sub- 
mergence depths at constant ra! 
forward speed, V..=2.93 ft/sec, iE 

Reynolds number=1.4X105, 
a= 10°, c=} ft, h=1.86 ft. 





a 


vo 1/M%, = 2.01 



































_£Q.(7) ’ 
Cc PERCENT CHORD —— 
‘ 20. 40 ~#60 Cp ie, 
o oo —= —<— ) ~~» + ¢ “ 
20 40 60 80 100 
t/uzy= 1.3! 1/Mz= 0.68 


* Associate Professor, Mechanical Engineering. 


1 J. S. Ausman, “Experimental Investigation of the Influence of Submergence Depth Upon the Wave-Making Resistance of a 
Hydrofoil.” Master of Science thesis, University of California, Berkeley, (1952). 

?H. Rouse, Fluid Mechanics for Hydraulic Engineers (McGraw-Hill Book Company, Inc., New York, 1938), pp. 388-390, 129, 369. 

3M. V. Keldysch, and M. A. Lavrentiev, “On the Motion of an Aerofoil Under the Surface of a Heavy Fluid.” Tech. Notes of the 
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Fic. 2. Comparison of limiting pressure coefficient theory with 
experimental data at varying water flow and varying depth. Hori- 
zontal asymptotes are from wind tunnel tests (Natl. Advisory 
Comm. Aeronaut. Tech. Rept. 613) at average Reynolds number. 
Definition of y. and a typical surface profile are shown below. 
Ve<v/gh. 


found on the upper surface of a hydrofoil at various 
submergence depths and angles of attack. The limiting 
values always occurred over a major portion of the 
upper surface, in the same manner as depicted in Fig. 1. 


SYMBOLS 


a=angle of attack of hydrofoil, measured from chord 
line joining trailing edge to furthermost point of 
leading edge. 

p=62.4/32.2 lb sec?/ft?, mass density of water. 

g=32.2 ft/sec?, gravitational constant. 


? 


M =——-, local Froude number. 
Vey 


V =ft/sec, local velocity. 
y=ft, local water depth. 
p=lb/ft?, local static pressure. 


Pe 





yoclile ; 
C,= , local pressure coefficient. 
V 2 


3eV 


c=ft, hydrofoil chord line length 
h=ft, depth of channel containing hydrofoil. 


SUBSCRIPTS 


0=stagnation conditions from Bernoulli equation. 
1=limiting value, or value ahead of hydraulic jump. 
2=value after hydraulic jump. 


LAITONE 


o =free stream conditions, y.. being defined as depth of 
trailing edge below free surface. 
*=critical condition where V,=+/gy,. 


THEORY 


If we have a large chord thin hydrofoil at small 
angles of attack, then as the hydrofoil approaches the 
free surface of infinitely deep water, we are justified in 
assuming that all the flow at and aft of the critical 
depth over the upper surface follows the hydrostatic 
relations; that is, the centrifugal or vertical accelera- 
tions are negligible so the velocity at each section is 
uniform over the entire water depth. The critical depth 
is defined the same as in open channel flow ;? namely, 





V2 
I.” or V,=V/gy,, (1) 
g 
so that the Froude number defined by 
Vy? 
w=— (2) 
gy 


is unity at the critical depth (y,). 

Immediately aft of the critical depth, the flow will 
always be supercritical or rapid (M>1) as long as the 
hydrofoil is at a positive angle of attack; consequently, 
the only means of recovering the pressure or water 
depth is by means of a hydraulic jump. The hydraulic 
jump” reduces the velocity and increases the depth 
(vi1<y, to y2>¥,.) by converting the flow from super- 
critical to subcritical with some loss of energy. The 
ratio of the water depths is given by’ 


2 
7? = 3C(14+89f,)!—1] (3) 
¥1 
for M;> 1. 

It is seen that y2=y, for critical flow (M,=1) and 
that y2/y, increases without limit as Mj, increases. 
However, the absolute value of y2 is limited to a maxi- 
mum of 0.8 yo, where yo is the upstream total head 
(equivalent to the isentropic stagnation conditions 
ahead of the jump) defined by the Bernoulli equation 
for hydrostatic flow as, 


V;? 
nt— 
Yo 2g M;? 
—s =1+—. (4) 
V1 V1 2 


- 





Then if we solve the equation 
d(y2/‘yo) - 
d(M,’) 


? 


we find the maximum value of y2/yo occurs when 


My =3, giving (y2/¥0)max= 0.8. 
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Corresponding to these conditions we have (using the 
relations given in reference 2) 


yi/yo=0.4, —-yo/r=2=V1/Vo, 
M?= 2 Ayo/yo= —0.05. 


The last expression shows that the loss of total head is 
only 5 percent, so the flow is practically a Bernoulli 
flow without any losses. 

Bakhmeteff* has shown that for /@,=Vv3 there is no 
abrupt jump, and when the actual rise in inches is small, 
the whole surface wherein the so-called hydraulic jump 
takes place may be smoothly continuous for a length 
slightly longer than 5 times the actual rise. The jump is 
actually unstable for M,<v3, and forms a very abrupt 
discontinuity for M,>v3 (see p. 249, reference 4 or 
p. 389, reference 2). When M,>v3 the energy loss 
increases rapidly with M, and becomes extremely large 
(see p. 390, reference 2). Bakhmeteff* also emphasizes 
that experimentally the small slopes cannot produce 
the larger hydraulic jumps corresponding to M,>Vv3. 

Therefore, if the flow over the upper surface of a 
hydrofoil has become hydrostatic as it approaches the 
free water surface, it must be limited by the following 
conditions: 


M/=3, yi/Vo= 3 N1/Yoos (S) 


where y2 is the depth after the weak hydraulic jump 
from the limiting depth y:, while y.. is defined as the 
depth of the trailing edge below the original free water 
surface as shown in Fig. 2. In general, because of the 
loss of total head, yo<y.. The largest possible value of 
yo(*¥.x) is provided by M,=V3, that is, by Eq. (5). If 
M,>Vv3, then ye is considerably decreased and the free 
stream water level behind the hydrofoil can force the 
hydraulic jump upstream until M,=v3 and y2—ye. 
However, this is only true for a hydrofoil in infinitely 
deep water since wave disturbances cannot be propa- 
gated upstream to the hydrofoil if V.>+/gh (where h 
is the finite depth of the channel containing the hydro- 
foil). In this latter case it is not even necessary to have 
a hydraulic jump. 

When hydrostatic conditions have been attained, the 
pressure coefficient may be written as 


P— Po 28 Yn f ¥— Yo 2/9 
C,= - ai =i). © 
pVw7/2 VIN Ye M7 \ Vu 
V.. being defined as the velocity of the hydrofoil rela- 
tive to the water. Then if y, is again defined as the 


depth of the trailing edge below the original free water 
surface, Eqs. (5) and (6) give a limiting value of 





1 V0 
Cie <ncentininon (7) 


M. Vie 


This provides a rigorous limiting value since the actual 


4B. A. Bakhmeteff, Hydraulics of Open Channels (McGraw- 
Hill Book Company, Inc., New York, 1932), pp. 229, 249-252. 


depth y2Sy, at the trailing edge, while the velocity 
there is always greater than V., especially for thick 
hydrofoils or large angles of attack. In these latter 
cases, however, the vertical accelerations may be so 
large that hydrostatic conditions are never established 
and, therefore, Eq. (7) cannot provide a satisfactory 
limiting value. 

Since hydrostatic conditions are never established 
near the leading edge, only near the critical depth which 
must occur aft of C,=0, y. must be defined at the 
trailing edge. Defining y.. in this manner also satisfies 
the Kutta-Joukowski condition, preventing an infinite 
viscous force at the sharp trailing edge by requiring 
that the pressure recovery be identical for the upper 
and lower surfaces. When yo2=y.. Eq. (6) gives C,=0 
for the trailing edge, agreeing with the experimental 
data of Fig. 1 for the larger submergence depths. As 
the submergence depth decreases, yo<y. and C,<0, 
again in agreement with Fig. 1. 

Figure 2 proves that Eq. (7) is in excellent agreement 
with the experimental data. Further justification of 
Eq. (7) and the assumption of the hydrostatic relations 
were obtained by measuring the water depths and find- 
ing them constant over the limiting flow portion of the 
hydrofoil, and agreeing with Eq. (5) after the jump. 

Also shown in Fig. 2 is the absolute limit for the 
pressure coefficient following the hydrostatic assump- 
tion, when no jump is formed and the flow velocity 
increases indefinitely until y approaches zero so that 
Eq. (6) reduces to 

- 28 


Co,—0= ois = 


oe 
M.2 V2 








(8) 


If Eq. (8) is exceeded, it proves that the hydrostatic 
relations are not attained and the vertical accelerations 
are not negligible. 

The other critical line shown in Fig. 2 is the one that 
must be exceeded to produce air entrainment, provided 
the hydrostatic relation is applicable. This condition 
can be derived from the momentum change required to 
reduce the net pressure to atmospheric, so that 


v 
f pdy—O= pgy’/2=pVy(Vi—V), 
or ° 
1 
y/n=1+—, (9) 
2M? 


since 
yV = yiVi. 


Then, if we further assume that the initial conditions 
are y=, M=M.,, (not necessarily true, since hydro- 
static conditions may not be reached ahead of the 
critical depth), Eqs. (6) and (9) yield 


C oe ) (10) 
oe M2\14+2M2) 
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Even if Eq. (10) applied, and hydrostatic conditions 
held, it could be exceeded before serious air entrain- 
ment and flow separation occurred because the hydrofoil 
surface is very smooth and the curvature is small. 

It is obvious that if hydrostatic flow conditions are 
not established, then Eqs. (7), (8), and (10) are not 
applicable, and in this case no limiting value can be 
predicted. This adverse condition would occur for very 
thick hydrofoils or large angles of attack because the 
severe vertical accelerations produced in these cases 
would preclude the possibility of ever establishing 
hydrostatic conditions. Another possibility invalidating 
Eq. (7) would be to have the average velocity corre- 
spond to M,>v3 before the vertical acceleration became 
negligible and hydrostatic conditions were established. 
However, this seems to occur only with a hydrofoil in a 
finite depth channel when V,.>1/gh so no wave dis- 
turbances can be propagated upstream, and therefore 
Eq. (7) is not applicable, as previously pointed out. 

It is easily shown that when yo= ye, 


8M? 
M.2= , (11) 
[(1+8M 2)!—1} 





which is of course the relation between the Froude 
number after the hydraulic jump (M,,”) to the upstream 
value (M,*). Equation (11) shows that 


M2S3 for M223. (12) 


This further proves the applicability of Eq. (7) as long 
as the hydrofoil moves in infinitely deep water, since 
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Eq. (12) shows that values of M,?>% cannot have 
M,>VN3 and still have y2—y.. 

It must be noted that the relations used to derive 
Eq. (7) are essentially for horizontal channels, since 
Bakhmeteff! has proved that the hydraulic jump rela- 
tions are satisfactory for predicting results only in 
channels sloping downward less than 4 degrees. How- 
ever, Eq. (7) itself may be expected to hold for greater 
angles than this, as shown in Fig. 2, because (yea—1)/¥1 
is practically independent of the channel slope (see p. 
252 of reference 4). The limiting pressure coefficient 
only depends on this value which then may be taken as 
} for all angles of attack between zero and 10 degrees. 
Visual observation made by the author indicated that 
Eq. (7) was definitely not applicable for angles of 
attack approaching 18 degrees. The indication was that 
channels sloping at 18 degrees or more actually could 
not support a weak hydraulic jump (M,=v3) but could 
only maintain a very abrupt strong hydraulic jump. On 
the hydrofoil this strong jump would occur behind the 
trailing edge and produce an energy loss of over 50 per- 
cent of the total head. In this case, the Kutta-Joukowski 
condition can be violated ; there is practically no lift but 
there is very large pressure drag, and Eq. (7) is of course 
not applicable. 

Also, as is well known, small span hydrofoils approach 
a lift force at the surface equal to one-half the lift force 
at great depths; therefore, small span hydrofoils are not 
limited by Eq. (7). This is also obvious from a physical 
consideration of the span-wise flow which makes the 
hydrostatic assumption impossible for a small span 
(low aspect ratio) hydrofoil. 
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ve Fast Time Analysis of Intermittent Point-to-Plane Corona in Air. III. The Negative 
| Point Trichel Pulse Corona* 
ive 
ice M. R. Amint 
la- Department of Physics, University of California, Berkeley, California 

in (Received September 21, 1953) 
Ww - 
son Fast oscilloscopic time analysis of the negative point Trichel pulse corona in room air at various pressures 
} and gap geometries reveals the following data. The very short rise and quenching time of the pulse at atmos- 
/M1 pheric pressure observed by English is confirmed. Under these conditions the secondary action is a photo- 
?*p. electric liberation from the cathode and discharge extinguishes by dissociative attachment to give O™ ions. 
ent Decreasing pressures reduces space charge density, prolongs the discharge and brings in a secondary libera- 
as tion by positive ion bombardment. Increasing potential at constant pressure at first decreases clearing time, 


leaving pulses unchanged and current increases proportional to repeat rate. At higher potentials, photon 





eS action is reduced relative to positive ion impact at the cathode, the discharge extends further into the gap, 
hat and quenching is very effective with less total ions reducing pulse size. With increasing repeat rate, current 
of no longer increases proportional to repeat rate in consequence of smaller pulse height. The effects of in- 
hat creased point diameter are increased pulse size. Comparable clearing times for large and small points require 
uld higher potentials for the former. Thus, at constant pressure and potential, increase in point radius produces a 
decrease in frequency and increase in total charge per pulse. Conditioning of the point at high-current 
uld densities and repeat rates decreases the number of ions per pulse, which is of the order of 10° under normal 
On circumstances, and shortens the pulse by reducing cathode work function and increasing photoelectric 
the liberation, so that choking occurs with little ion action. Since repeat rate is slightly decreased by the charge 
er- liberation, the reduced pulse charge yields a reduced current through a conditioning that decreases work 
oki function. Comparison of ion movement across the gap during clearing time, with that for O.* ions of known 
SKI mobility in air, indicates that the rapid ion transit in Trichel pulses is caused by a majority of O~ ions with 
but a mobility of about 4 cm*/volts second from near the cathode with some formation of O2~ ions in transit. 
irse 
ach S the potential of a negative point is increased, the at atmospheric pressure, the data for time scale analysis, 
na threshold for a self-quenching intermittent dis- so far, have been inadequate. English’s photomultiplier 
mr charge is reached where the current increases abruptly showed a surprisingly short time (4X 10~ second), for 
Ps from 10—" to about 10~* to 10-7 amp. At the threshold the duration of the Trichel pulse. The existing mech- 
woe the pulses are random and irregular in time, but with anisms could not account for this, until 1952, at which 





further increase of voltage they become quite regular 
in time. In fact they are so regular that they resemble 
relaxation oscillator pulses. Raising the potential more 
causes an increase in the current as well as the repeat 
rate of the pulses until a pulseless discharge is reached 
at about 10° cycles per second. Visually, the pulsating 
discharge appears as a typical glow discharge close to 
the point. There is a Crookes dark space, a bright 
negative glow, a Faraday dark space, and a fan-like 
positive column. Such a discharge generally appears in 
air or electron-attaching gases. 

This type of intermittent discharge was first observed 
by M. O’Day, but was studied in detail by Trichel,' and 
interpreted by L. B. Loeb. A comprehensive analysis 
and discussion of the early work was presented in an 
article by L. B. Loeb? in 1948. For a recent systematic 
study of current-voltage characteristics in standard 
geometry, the reader is referred to the paper of H. W. 
Bandel.* 

Except for the oscillogram of the photon pulse taken 
by W. N. English‘ for a single geometrical arrangement 

* This work was supported for one year on funds from the U.S. 
Office of Naval Research and for a second year under a fellow- 
ship from the Research Corporation of America. 

T Now at the University of Oklahoma, Norman, Oklahoma. 

1G. W. Trichel, Phys. Rev. 54, 1078 (1938). 

2 L. B. Loeb, J. Appl. Phys. 19, 882 (1948). 


3H. W. Bandel, Phys. Rev. 84, 92 (1951). 
4W. N. English, Phys. Rev. 77, 850 (1950). 


time L. B. Loeb’ provided the explanation. According 
to this mechanism, the pulse builds up in 10~* second 
by the formation of successive avalanches as the result 
of secondary electron emission from the cathode. 
Since the rise time of the pulse is so short, this second- 
ary action can be accounted for only by photoelectric 
effect at the cathode. Using the newly observed cross 
sections for dissociative attachment of electrons to O» 
molecules forming O~ ions at about 3 ev of energy 
(Geballe and Harrison®), Loeb was able to explain the 
fast time of choking. Thus, the formation of O- ions 
causes an accumulation of space charge near the point 
which quenches the pulse in some 2X 10-* second. 


OBSERVATIONS} 


The equipment and techniques used were those in- 
dicated in an earlier paper.’ As in the previous papers,’ :® 
the reproducibility and generality of the results justify 
the presentation of only a few illustrative oscillograms. 

Figure 1 trace B illustrates the short duration of the 
Trichel pulse at atmospheric pressures. The square 


5 L. B. Loeb, Phys. Rev. 86, 256 (1952). 

6 M. A. Harrison and R. Geballe, Phys. Rev. 84, 1072 (1951). 

t Oscillograms in Fig. 1 were obtained, using type 513D Tek- 
tronix. All other oscillograms were made from the traces of 511D 
Tektronix, and that is why the spikes in these pictures are broader 
than those shown in Fig. 1. 

7M. R. Amin, J. Appl. Phys. 25, 210 (1954). 

8M. R. Amin, J. Appl. Phys. 25, 358 (1954). 
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Fic. 1. Upper: 1X107* 
sec test pulse. Lower: Tri- 
chel photon pulse. 2.0-mm 
diameter point. 4-cm gap. 
Atmospheric pressure. 





test pulse of 1X10-* second, which was sent through 
the same system in Fig. 1, trace A, shows the short time 
resolution of the circuit. Figures 2, 3, and 4 show typical! 
photon and electrical pulses as the different variables 
are changed. Figure 5a shows the regularity of Trichel 
pulses while Fig. 5b provides a good example for com- 
parison of photon and electrical pulses. 

There are certain very interesting features. At lower 
pressures and/or higher fields the initial spike is 
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Fic. 2. Variation of Trichel pulse with repeat rate. 1.0-mm 
diameter point. 4-cm gap. Pressure= 100 mm Hg. RC=10°° sec. 
Sweep rate: 0.5 yusec/scale division. 


followed by a plateau. This is seen in both the electrical 
and the photon pulses. The photomultiplier, seeing 
only the luminosity, does not record the movement of 
the ions in the high field region. The electrical pulse, 
however, records this motion. Therefore, the ratio of 
the height of platéau to its spike is less in the electrical 
pulse than in the photon pulse. Moreover, at the time 
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Fic. 3. Variation of Trichel pulse with pressure. 1.0-mm diam- 
eter point. 4-cm gap. Repeat rate= 100 kc. RC=10~* sec. Sweep 
rate: 0.5 usec./scale division. 
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when the photon pulse drops rather sharply to zero, 
the electrical pulse shows merely a sudden break and 
with a change in slope gradually approaches the base 
line, Fig. 5b. This tail is a good evidence of ion motion 
in the high field region, since the electrons having a 
much higher mobility have been removed by this time. 
If this tail is extrapolated back to the earlier part of 
the discharge, then it appears that the relative shapes 
of electrical pulse caused by ionization resemble the 
photon pulse quite closely, Fig. 5b. Thus, ionization 
parallels excitation. In order to study the motion of 
negative ions in the low field region, reference is made 
to Fig. 6, where a large resistance (R= 10° ohms) was 
used for this recording. For comparison the positive 
burst pulse is illustrated in the same figure. 

Figure 7 shows the change in pulse shape as a result 
of the “Conditioning” (observed by English? and 
Bandel*) when the current reaches values near the 
transition to pulseless discharge. It is to be noted that 
the spikes are higher and the tails are shorter for a 
conditioned point. The conditioning also causes a 
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* Fic. 4. Electrical Trichel pulses for two different sized points. 
4-cm gap. Pressure= 200 mm Hg. RC=10-* sec. Sweep rate: 0.2 
usec/scale division. Left oscillogram: relative gain=1.0. Right 
oscillogram: relative gain = 2.0. 


change in frequency-current and frequency-voltage 
plots, Figs. 14 and 15. 

The data on pulse frequency against current and 
frequency against potential are shown for several pres- 
sures and points in Figs. 8, 9, 10, 11, 12, and 13. Other 
information and characteristics like current against 
voltage and number of charges per pulse, can be ob- 
tained directly from these plots. Unfortunately, these 
plots cover a short range because of the following 
limitations. 

1. Ultraviolet illumination of the point as a source 
of triggering electrons was not possible, since the ultra- 
violet source produces visible luminosity which gives 
spurious results and adds a high background noise to 
the photon pulse. Thus, studies could not be made at 
the Trichel pulse threshold. The appearance of multiple 
and diffusive spots at lower pressures and high po- 
tentials, and the limitation of voltage output of the 
power supply at higher pressures, prevented the study 
of higher frequencies. 

2. At lower pressures (20 and 40 mm Hg for 0.5-mm 


*W. N. English, Phys. Rev. 74, 170 (1948). 
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point) the frequency-current plots reached a maximum 
at about 130 microamperes, and then declined rapidly 
until the pulseless discharge was reached. In such 
cases the negative glow and other characteristics of 
the glow discharge disappeared, and a diffusive glow 
covered the whole point. Since such phenomena do not 
have the characteristics of Trichel pulses, their study 
was not pursued any further. Thus, the observation 
was limited to such pressures where the Crookes dark 
space, negative glow, etc., could be distinguished. 

3. Since studies of Loeb indicate the fan of purple 
glow to result from movement of the steep potential 
front of the negative ion space charge that choked the 
discharge out into the gap after the luminous part of 
the glow is largely over, an attempt was made to ob- 
serve this movement. However, the luminosity relative 
to the noise background was too weak to record and 
observe this movement. 
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Fic. 5a. Regular electrical Trichel pulses. 1.0-mm diameter 
point. 4-cm gap. Pressure=100 mm Hg. RC=10-* sec. Sweep 
rate: 5 usec/scale division. (Upper photographs.) 


ANALYSIS AND INTERPRETATION OF THE 
OSCILLOGRAMS 


First, an outline of the mechanism will be given and 
then it will be used to explain the interesting behavior 
of Trichel pulse as different variables change. 


1. Mechanism 


An electron emitted from the cathode under proper 
field conditions forms an avalanche. The photons 
created in this process strike the cathode liberating 
secondary electrons by photoelectric action (yp). The 
secondary electrons form a number of successive 
avalanches which give rise to a very rapidly growing 
pulse.>-§ By dissociative attachment the outwardly 
moving electrons form a space charge of O~ ions very 
near the point which when sufficient chokes the pulse 
(Loeb mechanism).° If by the time the positive ions reach 
the cathode the field is still reduced, the secondary elec- 
trons created by positive ion bombardment (y7) are not 
capable of forming any more excitation and ionization. 
Thus, the discharge chokes completely until the space 
charge has cleared and another pulse can start. On the 
other hand, if upon the arrival of positive ions at the 

§ The positive ions formed by the avalanches are created in a 


dense cloud at some distance from the point. These ions move into 
the cathode slowly. 








Electrical Pulse — 


Photon Pulse. 





Fic. 5b. Electrical and photon Trichel pulses. 1.0-mm diameter 
point. 4cm gap. Pressure=50 mm Hg. RC=10-* sec. Sweep 
rate=0.5 yusec/scale division. (Lower photographs.) 


point, the reduced field has not developed because of 
insufficient density of electron attachment (lower 
pressures and/or higher fields) the secondary electrons 
liberated are capable of forming more avalanches, and 
the discharge continues by 77 action until enough space 
is formed to choke the pulse completely. It must be 
pointed out here that when the positive ions become 
active in secondary electron emission, the dicharge will 
be restricted to a narrow and dense region along the 
axial direction in comparison to the discharge with 
photon action. Such a constriction of the discharge is to 





Fic. 6. Electrically induced pulses. 1.0-mm diameter point. 4-cm 
gap. Atmospheric pressure. R=10* ohms. RC=10~ sec. Sweep 
rate: 1000 yusec/scale division. Upper: burst pulse at 7.6 kv. 
Lower: Trichel pulse at 8.2 kv. 


be expected, since the field is highest along the axis, 
and positive ions are not widely dispersed as are the 
photons. 

From the foregoing discussion, therefore, one would 
expect the space charge formed by y# action to be more 
nearly in the form of a hemispherical shell (because of 
the diffusive photon action) and the space charge 
accumulated by the yi action (because of constricted 
discharge) to be in a narrow and dense region in the 
direction of axis. 
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Fic. 7. Electrical Trichel pulses. 1.0-mm diameter point. 4-cm 
gap. Pressure=100 mm Hg. RC=10-* sec. Sweep rate: 0.5 
usec/scale division. Left oscillogram: relative gain=1.5. Right 
oscillogram: relative gain= 1.0. 
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Fic. 8. Frequency versus current for Trichel pulse corona. 


2. The Effect of Changing the Pressure 


At high pressures the X/p (ratio of field to pressure) 
at onset is slightly lower than at lower pressures. The 
mean free paths are short. With high pressures at low 
X/p, excitation is high, yielding dense photon liberation 
near the surface. The dense photon action produces a 
rather heavy burst of current through a photoelectric 
(yp) secondary mechanism.°* At high pressures the dis- 
sociative attachment occurs in a relatively short dis- 
tance beyond the positive ion space charge cloud 


, 





100 MM H 
400 ° 





” 

a 

© 300r 
% 

z 

5 200+ 

z 

w 

2 

fs 

= 

— 0.5 MM DIA. NEG. POINT 


4 CM GAP 


8 





1 | l 


° 80 120 
CURRENT IN p AMPS 








Yc. 9. Frequency versus current for Trichel pulse corona. 


AMIN 


producing a negative ion space charge of high density 
close to the point. The field from this rapidly quenches 
the discharge. Thus, the pulse rises very sharply and 
declines rapidly, Fig. 1 trace b. 

Decreasing the pressure results in diffusion of photon 
production, a reduction of photon density, and, there- 
fore, a decrease in the number of photon striking the 
cathode. Decrease of the molecular density reduces the 
density of the negative ion space charge formation by 
dissociative attachment, and projects it further into the 
gap. This requires that more ions be created to yield 
fields that quench the discharge. Thus, the number of 
ions per pulse will increase, Figs. 8-10. The rate of ion 
production being decreased and the required ions for 
quenching being increased, a longer time for the 
buildup of space charge for quenching is necessary. In 
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Fic. 10. Frequency versus current for Triche! pulse corona. 


this longer time the positive ions reach the cathode in 
appreciable number. The space charge is then provided 
by the ionization resulting from the very efficient 
secondary electron emission as a result of positive ion 
impact at the cathode. The effectiveness of yi action at 
lower pressures is also increased by the higher positive 
ion energies and drift velocities. 

The effect of these conditions is clearly shown in the 
oscillograms of Fig. 3. Thus, with decreasing pressure, 
reduction in the height of the spike (photoelectric 
action yp), and an increase in duration of plateau 
(positive ion action yi) are observed. Obviously, the 
clearing time between pulses will decrease relative to 
those at higher pressures under equivalent condition, 
because of. the higher negative ion mobility. Repeat 
rates will thus be higher for equivalent potentials, Figs. 
11-13. 
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3. The Effect of Changing the Potential 


As the potential is increased beyond the threshold, 
first the frequency rises linearly with current and then 
the current grows more slowly than the repeat rate. 
This indicates a constant total number of charge per 
pulse for the linear region and a decrease in total charge 
per pulse at higher currents and voltages, Figs. 8-10. 

It appears that the increase in potential above 
threshold does not affect the fields across the active 
discharge region. This lies in the inhibiting effect of the 
space charge of the negative ions. Increasing potential 
merely acts to clear the negative ion space charge more 
rapidly. Therefore, the frequency and current rise 
nearly linearly with potential (Ohm’s law regime). In 
this region only the yp action is effective and when the 
positive ions arrive at the cathode, the field is stili 














0.25 MM DIA. NEG. POINT 
c 4 CM GAP 
” 
a 
© 
8 sook 
z 
> 
oO 
Pe 
2 200 MM 400 
4 200Fr “s 
< 758 
be 
100F 
i l 
° 5 10 iS 


KV 


Fig. 11. Frequency versus potential for Trichel pulse corona. 


quite low owing to the presence of negative space 
charge. Thus, the positive ion secondary action is in- 
effective. 

With further increase in voltage, marked changes in 
mechanism take place since the increase in potential 
affects the active discharge region. As the result of in- 
crease in field, the negative ion space charge is formed 
further away from the point. The positive ions enhance 
the fields of the cathode. The ion mobilities are higher, 
and, thus, secondary electron liberation by positive 
ion bombardment at cathode is more intense and begins 
earlier and lasts longer. With effective ionization 
further out in the gap, at higher fields photons are lost 
to the cathode by diffusion. In consequence, the pulses 
will have a longer plateau and a smaller spike, Fig. 2. 
The reduction in the height of the spike is due to a 
decrease in yp action, and the elongation of the tail is 
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Fic. 12. Frequency versus potential for Trichel pulse corona. 


due to the fact that a longer time is required for 
quenching the discharge. 

‘As was mentioned in the discussion of the effects of 
lowered pressure, whenever the positive ions are 
active, the discharge is restricted to a narrow and 
dense region along the axis; that is, increased in length 
along the axis but constricted radially. In the re- 
stricted volume of ionization, the space charge density 
of negative ions formed is great enough to quench the 
discharge with a smaller total number of negative ions 
than under conditions where the discharge spreads 
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radially over a greater area of the point, even though 
quenching requires a longer time. Thus, the decrease 
in the total charge per pulse is explained, although yi 
action is enhanced. 

Increase in frequency of pulses with increase in 
potential is expected since it is predominantly the dead 
time,|| or time of adequate removal of the space 
charge, and not the time of quenching which determines 
the frequency. When the dead time becomes very 
short, so that the space charge is removed as fast as it 
is formed, a pulseless Townsend-type discharge is 
achieved. 


4. The Effect of Changing the Geometry 


As has been pointed out by Trichel' and as was ob- 
served here, the variation of gap length is not a signifi- 
cant factor in these studies. However, the change in 
point radius is of interest, Fig. 4. The more efficient yp 
action for the larger point (higher spike) is to be ex- 
pected, since it has a larger area for emission of second- 
ary electrons. Also, the strong field extends further into 
the gap and more ions are needed to choke the pulse. 
In order to obtain the same’frequency at a given pres- 
sure the larger point would require a higher voltage, 
since the dead time is dependent on the value of X/p 
at the point; hence, keeping the potential and pressure 
constant, an increase in point radius causes a decrease 
in frequency and an increase in total charge per pulse. 


5. The Effects of “Conditioning” 


Running the discharge for some time at higher fre- 
quencies and/or extending potentials to the region of 
the puiseless discharge causes some changes in the 
characteristics of Trichel pulse corona. Such changes 
were reported by English,® Bandel,? and C. G. Miller," 
but were not understood. Figure 7 illustrates this effect. 
It has been termed “conditioning” in the past without 
knowing exactly what it implied. It must be pointed 
out, however, that the conditioning was not due to a 
change of the geometry of the point, since after a day 
or so the point nearly recovered its original character- 
istics. 

The increase in spike height and reduction in plateau 
and total duration here observed, Fig. 7, at once permit 
an explanation of the conditioning process. It is clear 
that the increased spike implies a very great increase of 
yp, the photoelectric secondary emission. In conse- 
quence, as at high pressures, the very heavy current 
quickly produces enough negative ions near the point 
to choke off the discharge rapidly. That is, a high yp 
yields such a current that before many positive ions 

|| The dead time here is the time required to remove the space 
charge far enough from the point so that another pulse can start. 
It is essential to distinguish the dead time-like intervals for move- 
ment of local space charges far enough from the point to permit 


new pulse formation from the clearing time-like intervals in- 
volving clearing of space charge more or less completely from the 


gap. 
© C. G. Miller and L. B. Loeb, J. Appl. Phys. 22, 614 (1951). 
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have moved in to give a yi plateau, the discharge is 
quenched. This can only happen if the work function of 
the cathode has decreased, which can yield a very large 
increase in yp. The rapid choking by heavy current 
increases the space charge close to the point in a short 
time, so that negative ions accumulate faster than they 
are drawn away. In comparison to a slower buildup of 
the pulse, this means that less total ions are needed to 
choke the pulse. Thus, the number of ions per pulse is 
decreased when conditioning occurs, Fig. 14. 

Since the photon-conditioned yp predominates as a 
source of current, and by its nature produces the 
negative space charge close to the point and radially 
spreads over it shortening quenching time, it will also 
increase clearing time slightly. That is, in contrast to the 
removal of longer column of choking negative space 
charge produced by vi, the ion distribution around the 
point with yp action requires somewhat more time for 
removal. It should be pointed out that English, Bandel, 
and Miller had remarked on the changing of negative 
glow from a broader spot to a narrower, deeper spike 
under conditions where we now know yi to have been 
active. Thus, the conclusions drawn are in conformity 
with visual manifestations. In consequence of con- 
ditioning with decreased work function of the cathode, 
the mechanism is altered, spreading the discharge 
radially, increasing photon action, decreasing yi 
action, decreasing quenching time and charge per 
pulse, and decreasing frequency at a fixed potential as 
seen in Fig. 15. 


6. Comparison of Trichel Pulse with Burst Pulse 


Using a large resistor in series with plane enables 
one to observe the motion of ions (positive or negative) 
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across the gap. Figure 6 illustrates such observations 
for the positive burst pulse trace @ and negative 
Trichel pulse trace 6 under exactly the same condition, 
except that the potential for burst pulse was somewhat 
less than that for the Trichel pulse. The inductive kick 
signifies the time of occurrence of the pulse. This spike 
is followed by a plateau indicative of the motion of the 
ions in the low field region. The motion of the ions in 
the high field region is not registered because of the 
long RC time constant of the circuit (RC = 10~ second). 
From the comparative study of the two types of 
pulses, the following interpretations were made. 

It is to be noted that the positive ions of the burst 
pulse (time of formation 5X10~ second) have a time 
of crossing of about 5000 microseconds. This is indi- 
cated by the break in the plateau of the upper trace. 
Following the break, the trace approaches the zero 
position gradually in some 10~ second. This is because 
of the long time of formation of the burst pulse (5X 10~ 
second) and diffusion of ions. The positive ions are 
identified as O.* ions, since the work of R. N. Varney" 
has shown that the O* ions, having a higher ionization 
energy than O,* ions, cannot exist under the conditions 
of this experiment, and lose their charge to form O.+ 
ions by exchange forces. According to Varney, the 
mobility of O2*+ ions in oxygen at atmospheric pressure 
is 2.25 cm?/volt second. The mobility of O.* ions in air 
so far has not been measured accurately. However, 
from the above value and the early investigations on 
mobilities of positive ions in air, the mobility of O.* 


"R. N. Varney, Phys. Rev. 89, 708 (1953). 
2 L. B. Loeb, Fundamental Processes of Electrical Discharges in 
Gases (John Wiley and Sons, Inc., New York, 1939), pp. 1-31. 
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ions in air at atmospheric pressure is expected to be 
between 1.8 and 2.2 cm?/volt second. 

A glance at the Trichel pulse reveals that the break 
appears some 2500 microseconds after the time of for- 
mation of the pulse. Since the mobility of O. ions in 
air at atmospheric pressure, according to Doehring,” 
is 2.68 cm?/voltXsecond, the break could not be ac- 
counted for by the arrival of O.- ions at the plate if 
these ions are formed in a region only a few millimeters 
from the point. This indicates the presence of ions of 
higher mobilities. Therefore, on the basis of our mech- 
anism, it is logical to asume that the break is indicative 
of the arrival of O~ ions. Such an assumption would 
mean that the O- ions in air have a mobility of about 
4 cm?/volt second. The broad tail following the break 
and gradually approaching zero is explained in the 
following manner. 

According to the mechanism some of the electrons 
which do not undergo dissociative attachment, say 
some 20 percent at 760 mm Hg, form O;- ions by Brad- 
bury-Bloch™ process in the lower field region. Thus, a 
very diffuse cloud of O:- ions is formed in a relatively 
large distance from the point, say, a few millimeters. 
This could account for the long broad tail. Also, the 
self-repulsion and diffusion of a dense cloud of O- ions 
contributes to this broadening appreciably. It must be 
pointed out that if there were present only one type of 
ion (created near the point), the break would be much 
sharper and the tail much shorter, since the time of 
formation of Trichel pulse (10-* second) is about 104 
times shorter than that of the burst pulse. Yet the 
burst pulse gives a comparatively longer plateau and 
shorter tail. It is true that the field is somewhat higher 
for the Trichel pulse than for the burst pulse. However, 
the difference in field would be effective only for motion 
in the high field region, but the small difference would 
not affect appreciably the time of crossing the whole 
gap. 
The following explanation cannot be ignored en- 
tirely. If the O~ ion formed complex ions of lower 
mobility than that of O,* ions in the lower field region, 
then it is possible to account for the break at 2500 
microseconds as being caused by O,~ ions that form # 
the way to the anode and the long tail as being caused 
by the clustered O- ions. Since for choking the pulse, 
O- ions must form near the point to the extent of at 
least 50 percent of the electrons, it is clear that the 
rather sharp break at 2500 microseconds with the 
very gradual tailing off of the pulse is inconsistent with 
the gradual formation of O:- ions in mid-gap which 
would give a very broad first break. 

The writer wishes to express his gratitude to Pro- 
fessor L. B. Loeb, under whose suggestion and direction 
this work was done, for his guidance and valuable 
assistance throughout the work. 


3 Doehring, Z. Naturforsch. 7a, 253 (1952). 
“4 F. Bloch and N. E. Bradbury, Phys. Rev. 38, 689 (1935). 
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A study of bulk minority carrier lifetimes and surface recombination rates has been made, using experi- 
mental equipment similar to that of Navon, Bray, and Fan, wherein the time-decay of carriers injected 
electrically by a pulse is observed. The diffusion theory for carriers of bulk lifetime r has been worked out, 
considering a definite probability of nonrecombination, or reflection coefficient, to be associated with the 
surfaces of the sample. A simple relationship between this quantity and the surface recombination velocity 
has been derived. If the surface recombination velocity is known, the effective trapping area at the surface 
may be calculated from this relation. Values of bulk lifetime and surface recombination velocity have been 
found for both n- and p-type germanium samples, and for a wide variety of surface treatments. By suitable 
modifications in experimental technique, this experiment may be utilized to measure drift mobility of 


minority carriers. 


INTRODUCTION 


N experimental technique developed by Navon> 
Bray, and Fan! has been used in this laboratory 
to make an extensive study of bulk lifetime and surface 
recombination in germanium. It has been found possible 
to utilize diffusion theory appropriate to the sample 
geometry used in the experiments to separate bulk and 
surface effects, and, by slight modifications in experi- 
mental technique, to calculate drift mobility of mi- 
nority carriers from recombination measurements. 
Briefly, the experimental technique involves electrical 
injection of minority carriers into a sample of ger- 
manium in the form of a rectangular parallelepiped by 
the application of a voltage pulse across the sample, 
and observation of the decay of the injected carrier 
density as a function of time thereafter by measuring 
the conductivity of the sample with a very small sweep 
voltage. A schematic diagram of the experimental ap- 
paratus is shown in Fig. 1. 


DIFFUSION AND RECOMBINATION OF 
INJECTED CARRIERS 


Carriers injected into a semiconductor move by the 
usual processes of diffusion, and the injected carrier 
density, that is, the actual carrier density minus the 
thermal equilibrium carrier density, satisfies the 
diffusion equation (1). 


n On 
D(Vv*n)—--=—. (1) 
rt ot 


Here » is the injected carrier density, D is the diffusion 
coefficient for the injected carriers, and r is the average 
lifetime of injected carriers in the bulk of the material. 
It will be assumed that the diffusion equation holds in 
a rectangular solid whose thickness, width, and length 
are given by Xo, yo, and 20, respectively. It will later be 
necessary to assume that x<yo<zo. In this geometry 
we express as n(x,y,z,/), and it is assumed that for 


‘Navon, Bray, and Fan, Proc. Inst. Radio Engrs. 40, 1342 
(1952). 


‘=O there is present a uniform injected carrier density 
no throughout this volume, that is, 


n(x,y,2,0) = no= const. (2) 


It is necessary also to account for the fact that 
trapping and subsequent recombination at the surface 
of the sample occurs more or less independently of the 
bulk recombination. The surface recombination may 
be characterized by the probability of nonrecombina- 
tion for a carrier upon colliding with the surface 
potential barrier. This quantity is termed the surface 
“reflection coefficient” for carriers and denoted by R. 
Figure 2 represents a situation where A carriers per 
unit time are introduced into the surface region. 
Of this number RA are “reflected’”’ back to the bulk. 
Those entering the bulk are subject to various scatter- 
ing and absorption processes, so that a fraction B will 
eventually return to the surface. The carriers are thus 
subject to a “multiple reflection” process, R being the 
reflection coefficient of the surface and B being the 
effective reflection coefficient of the bulk, as shown in 
Fig. 2. Summing carrier flux to. the right and to the 
left, one obtains (3) and (4), respectively. 


- A 
F,= AD RiBi= —— (3) 
i=0 1—RB 
2 AR 
F,= ARDY RiBi=——_. (4) 
i=0 1—RB 


It must be true that if R=1 


x 


n, {m \' f* 
Fi=Fi= f v,P(v,)dv,= (—) J Vz 
0 /t 2kT 0 


kT \3 
Xexp[ — mv2/2kT |dv,=n, (—) , (5) 
2am 





where ® (v,) is the Boltzmann velocity-in-x distribution 
and n, is the injected carrier concentration in the sur- 
face layer. Since the mean thermal velocity is given by 
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@=[(8/x)(kT/m) |!, one may write (5) in the form 


nC 
F,=F.,= a, (6) 
4 


In these equations, m refers to the effective mass of 
carriers in the semiconductor. 

If R<1, there will be a net flow of carriers toward 
the surface, and F; will be greater than F>. In this case, 
if the fractional concentration variation over the carrier 
mean free path is small, one may consider that the 
flux F, which arrives at the surface originates at a 
virtual source situated to the left of the surface (Fig. 
2) a distance aA, where J is the carrier mean free path 
and @ is a numerical constant of the order of unity. 
Similarly, F; may be assumed to originate at a virtual 
source situated a distance ad to the right of the sur- 
face. Then, substituting the value of mn for these 
virtual sources in (6), 








q On\ 1 

Pi n-an(—) a (7) 
a OxJ 44 
4 on\ 1 

F,= ntan(—) . (8) 
| dx/ 44 


Adding (3) and (4), equating the result to the sum of 
(7) and (8), and solving for A, the result is 


A=———-. (9) 


Subtracting (4) from (3) and utilizing (9), one obtains 
(10). 
nf1i—R 
F,— F,= — ——. (10) 
2 1+R 


Here, F,— Fz is the net flow toward the surface, which 
must equal the normal component of the classical 
diffusion flow, —D(U-Vn), at the surface, and we 
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Fic. 1. Schematic block diagram of lifetime measuring apparatus. 
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Fic. 2. Multiple reflection representation of carrier diffusion in 
the neighborhood of a surface. 


shall identify the coefficient of m, in (10) with the 
parameter s, usually referred to as the surface re- 
combination velocity. U denotes an outwardly directed 
unit vector normal to the surface. Thus 


ng f1—R 
F,—F,.=—D(U-Vn),= ~(—) =m (11) 
2 \1+R 


This immediately establishes a further boundary 
condition on (1): 
— D(U-Vn),=n,s (12) 


and also expresses the surface recombination velocity 
in terms of a more fundamental quantity, the prob- 
ability R of nonrecombination for a collision with the 
surface. Using (11), (7), and (8), 


¢1-R F,-—F; adé a) 








s=- = 


21+R nN, 2 Ox 


From (7), (8), and (11), noting that D=)Z@/3, it is an 
easy matter to show that in fact a=. 

It is necessary then to solve the diffusion equation (1) 
subject to the boundary conditions (2) and (12). The 
equation can be solved in a rectangular solid of dimen- 
sions Xo, Yo, 20, but the solution is extremely complicated. 
We shall assume that x is small compared to yo and 20, 
obtain the solution to the diffusion equation in a slab 
of thickness xo extending to infinity in the y and z 
directions, as a first approximation, and draw upon the 
results of the 3-dimensional solution to make correc- 
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Fic. 3. Assumed orientation of sample for diffusion calculations. 


As a first approximation yo and 2 are assumed to be infinite, re- 
sulting in a large slab of material of thickness xo. 
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tions to the first approximation for side and end effects 
arising from the fact that yo and 2» are finite. 

Proceeding in this manner, the boundary value 
problem becomes 


D—--=— (14) 


on 


n(x,0)= 9; D — =N,5. (15) 
‘ Ox), 


The equation may be solved under these boundary 
conditions by the classical methods of separation of 
variables and expansion of the initial carrier density in 
terms of eigenfunctions which satisfy the boundary 
conditions.” If the origin is taken in the center of the 
sample (Fig. 3), the solution of (14) which satisfies the 
boundary conditions (15) is 


« sin(é/2) 
n(x,t) = 4nge—"/* > ———_——_ 
iit a@+sinéa 


Xexp[—2§?Dt/x? |cos(Earx/xo), (16) 


where the set of eigenvalues (£;) is chosen such that the 
eigenvalue &; is the ith positive root of the equation 


ctn(é9/2)— &rD/sxo=0. (17) 


2See for instance W. Shockley, Electrons and Holes in Semi- 
conductors (D. Van Nostrand Company, Inc., New York, 1950), 
p. 319. 
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The actual conductivity change do is easily found 
from (16), since 


Cot xo/2 
bo = =f n(x,t)dx= S8noeque"!* 
Xo Y—z0/2 


oo sin? (gar / 2 ) 





—expl —7§?Dt/xe], (18) 
imt£ a (£ar+siné,r) 


where éo is the electronic charge and yw the drift mobil- 
ity of the minority carriers. (18) represents the time 
decay of the injected carrier density and corresponds 
to the trace observed on the screen of the oscilloscope. 
The surface recombination velocity influences the values 
of the eigenvalues (£;) through (17) and thus influences 
the time decay of (18). 

It is desirable to analyze (18) as a function of time in 
such a way as to be able to infer the surface recombina- 
tion velocity, assuming the bulk lifetime to be known. 
This can be accomplished by observing that whatever 
the value of s may be, the value of £, (and thus the 
value of s) may be obtained by measuring the decay 
constant of (18) after some time has elapsed after in- 
jection. This is true because after a comparatively short 
time the magnitudes of all the terms in the summation 
in (18), other than the first, become negligible com- 
pared to that of the first term. For small values of 
s(s&2D/xo), from (17) it is clear that £<=2i—2, and 
under these conditions, using L’H6pital’s rule to 
evaluate the limits problem which arises when i= 1, it 
can be shown that the amplitude factor of the terms for 
which i>1 is small compared to the amplitude factor 
of the first term. Also, the time constants of the higher 
terms are smaller and thus after a short time these 
terms become very small indeed. For large values of 
s(s>>2D/xo), (17) shows that £<+2i—1. The time con- 
stant of the second term is only about § that of the 
first, and the amplitude factor-of the second term is 
also about § that of the first, so again after a short 
time this term can be neglected. The third and higher 
terms are even smaller. For intermediate values of s, 
both of these arguments apply to a certain extent, the 
result being essentially the same; one may measure £; 
by measuring the time constant of the “tail” of the 
function (18). 

Neglecting terms in the summation in (18) other 
than the first, and denoting the over-all observed 
decay constant as to, we may write 


1 1 é&'rD 
—=—+——. (19) 
: 


To Xo" 





Solving this equation for £, substituting in (17), and 
solving for s, the result is 
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This equation has been derived under the assump- 
tion of surface recombination in an infinite slab of 
thickness xo. To deal rigorously with recombination in 
a rectangular solid, it is necessary to solve a 3-dimen- 
sional boundary value problem (1) using boundary 
conditions (2) and (12). It is possible to avoid the 
complication involved in using directly the results of 
this solution by incorporating the effects of recombina- 
tion on the side and end surfaces of the sample into a 
corrected sample thickness, or “virtual thickness,” 
¥o(< xo). This corrected thickness Zo is defined as the 
thickness a sample of infinite extent in the y and z 
directions would have to have to produce the same ob- 
served decay constant 7», which the actual sample 
under consideration, with finite dimensions xo, Yo, 20, 
exhibits. Such a treatment is an approximation and 
demands that x» be small compared to yo and 29. (See 
Fig. 3.) In actual practice it turns out that xo~yo/3 is 
satisfactory. 

The solution of the diffusion equation in a 3-dimen- 
sional rectangular solid under the boundary conditions 
(2) and (12) is a triple summation of products of func- 
tions of the same form as those shown in the summation 
in (18). These functions involve three sets of eigen- 
values, (£;, 7;, ¢,) for the (i,j,k) term of the triple sum- 
mation corresponding to the x,y, and z directions, re- 
spectively. It can be shown in a manner similar to that 
used in the first approximation that the time constant 
of the “tail” of the da vs / curve obtained from the 
rigorous solution of the diffusion equation is essentially 
that of the (1,1,1) term of the triple summation, the 
effects of the higher terms becoming small after a 
comparatively short time following injection. The 
analogs of Eqs. (19) and (17), respectively, are 


1 1 é? nr ite 
—=-+2:(=+" 4) (21) 


9 9 9 
To T Xo Vo 20 
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Fic. 4. Graphical representation of the roots of an equation of 
the form of (22) for large and small values of s. 


and 





ctn(&r/2) = £rD/sxo 
ctn(qr/2)=rD/syop. (22) 
ctn(¢yr/2)=fi9D/ szo 


In this case £1, m, and ¢, cannot be eliminated between 
the four equations (21) and (22) to give an explicit 
formula for s, analogous to (20). Nevertheless it is 
possible to calculate s by making suitable approxima- 
tions and introducing the virtual thickness Zp. 

Suppose that s is small (s«2D/zo). Then, in (22) the 
cotangents will be much larger than their arguments, 
which implies that the eigenvalues (£1, m1, £1) will be 
small compared to unity. This situation is shown 
graphically in Fig. 4. In this case, one may assume in 
(22) that ctn x1/x—<x/3 and eliminate &, m, ¢ be- 
tween the four equations. The result is 


1 1 1 1 
_ 12sp| , + ~_— 
xo(6D+sx0) yo(6D+syo) 





> eR (23) 
29(6D+ szo) 


which reduces to 
1 1 >. 2 & 
ae 
To T Xo Yo 20 


if sxo, Syo, and szo are neglected compared to 6D. We 
shall write the above equation 


ft is 
——-=—(1+a+8), (24) 
To T NX 
where 
Xo Xo 
a=— B=—. (25) 
Yo 30 


In the first approximation the analog of this equation 
is (19). In (17) if s&2D/x, ctn x&1/x and §&2r* 
= 2sxo/D, whence (19) can be written 


1 1 2s 
——-=—., (26) 


To T Xo 


Comparing (24) and (26), it is obvious that if r is 
identical for the actual finite sample and for a hypo- 
thetical one-dimensional sample of thickness #, then 


x 
Fo= —_——_- (s{&2D/z9). (27) 
i+a+, 


If, on the other hand, s>>2D/a, then the eigen- 


values (&, m1, £1) are all nearly unity (see Fig. 4). In 
this case one can write (21) 


1 1 1 wD 
~--="D(—+—+— =— (1+a?+8'). 


Xe yor Zo" Xo 





638 5. Rs 


Since the simple one-dimensional treatment gives 
(1/79) — (1/7r)=2°D/x? from (19), it is obvious that 
in this case 
Xo 
2.32 ————_—_—_——_ (s>>2D/x»). (28) 
(1+a?+ 8")! 


To summarize the preceding arguments, if s<2D/20, 
a virtual thickness Zo= xo/(1+a+ 8) is used to replace 
xo in (20) in order to calculate s; if s>2D/xo a virtual 
thickness %o= x/(1+-a?+ 6")! is used. Since in any prac- 
tical experimental case a1, 81, these corrections 
are not very large. If s lies between these two values, 
a correction intermediate in value between the two 
cases discussed above, arrived at by some manner of 
averaging, may be used. Note that for both corrections 
oxo as a and 8 approach zero. Some slight fore- 
knowledge of the order of magnitude of s is required 
to know which correction to apply. This is usually 
obvious from the relative magnitude of r and the ex- 
perimentally observed ro, and in any case can be calcu- 
lated with sufficient accuracy to determine which cor- 
rection is applicable by arbitrarily choosing one of the 
corrections (27) or (28) and substituting in (20). The 
values of Z given by (27) and (28) are always close 
enough together to indicate unambiguously which cor- 
rection should be applied. 

To apply (20) to determine surface recombination 
velocity, in addition to the experimentally measured 
decay constant 7», it is necessary to know the bulk 
lifetime r and the diffusion constant D for minority 
carriers. These two quantities may be determined ex- 
perimentally by making minor changes in technique 
which are described below. 


EXPERIMENTAL TECHNIQUES 


A block diagram of the experimental equipment is 
shown in Fig. 1. A pulse generator produces a sharp 
pulse of 5 usec duration and 300-volts amplitude across 
the germanium sample. Type 4C35 hydrogen thyratrons 
are utilized in the pulse generator to produce an output 
impedance of the order of 100 ohms. The pulse produced 
is of sufficient amplitude and duration to inject minority 
carriers and to sweep them uniformly throughout the 
sample. The pulse repetition rate is 60 sec~'. The con- 
ductivity of the sample is measured subsequent to the 
pulse by measuring the voltage across a resistor in 
series with the germanium and a current source, and 
the measuring current is in most cases kept sufficiently 
small so that the proportion of carriers swept out of 
the sample by the conductivity measuring field to the 
total number of injected carriers is small. For samples 
of quite long lifetime, however, it is often necessary to 
apply a correction to account for carriers swept out 
of the sample in this manner. This correction will be 
discussed further below. The 1-V diode is inserted in 
the circuit to prevent the full pulse voltage from being 
applied to the input of the amplifier. The indicating 
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instrument is a Dumont type 256-D oscilloscope which 
is equipped with triggered sweeps of various lengths 
and with calibrated time-base markers. The sweep is 
initiated by the injecting pulse. In practice, the time 
constants are arrived at by measuring photographs of 
the oscilloscope traces. It is felt that time constants 
can be measured in this manner with about +5 percent 
accuracy. 

Measurements of bulk lifetime are made using sam- 
ples cut directly from zone-melted single crystal ingots 
and surface-treated to reduce surface recombination to 
an absolute minimum. The very large size of these 
samples (10X10X20 mm) and- consequent small sur- 
face area-to-volume ratio, in conjunction with the low 
surface-recombination etch which is used, enable one 
to observe directly the bulk lifetime +r as the decay 
constant of the observed lifetime trace. Naturally, for 
samples of very long bulk lifetime it is not possible 
to neglect surface effects in this manner, but for the 
samples used in these investigations (r~200 usec) the 
discrepancy between the observed time constant and 
the actual bulk lifetime is well within the uncertainty 
of the measurements. This can be seen at once from (12). 

Surface recombination velocities were determined for 
a large number of surface treatments by cutting small 
samples out of zone-melted single crystal ingots whose 
bulk lifetime had been measured by the method de- 
scribed above. The samples employed for this purpose 
were about 1 mm thick, 3 to 5 mm wide, and 10-15 
mm long. The observed decay constant for injected 
carriers was obtained and s calculated using (20) with 
the appropriate virtual thickness %) obtained from (27) 
or (28). In the case of n-type samples good injection 
usually cannot be obtained from the pressure-type 
contacts used in this experiment unless the contact 
ends of the sample are metal-plated. In these experi- 
ments copper-plating has been employed, but nickel- 
plating will do as well. Soldered contacts may be used 
but are more difficult to prepare and necessitate heating 
the sample, which may be undesirable. Care must, of 
course, be taken to plate only the ends of the sample, 
as contact of the plating solutions or plated film with 
the etched sample surfaces may produce inaccurate 
determinations of surface recombination velocity. If 
the plating is confined to the end of the sample, the 
effect of this treatment on the surface recombination 
velocity may be neglected, since carriers are swept 
away from one plated contact by the conductivity 
measuring field, and their disappearance at the other 
contact is corrected for, if necessary, by the normal 
correction for sweeping effect discussed below. Also, 
the plated area is small compared to the total surface 
area of the sample. In the case of p-type samples, how- 
ever, electron injection is unsatisfactory from copper- 
plated contacts. Bare metal-to-semiconductor contacts 
are sufficient to effect satisfactory electron injection 
into p-type germanium. This type of contact on n-type 
samples, however, causes very unsteady or “noisy” 











oo tm {P SHY tt "SS ™ 


— ¢ 


— 


eee SED 


—— 





VOLUME AND SURFACE RECOMBINATION RATES 


injection. It has been possible, in spite of these diffi- 
culties, to make some observations on samples with 
end contacts unplated, and then to plate the ends of 
the samples as described above and measure the decay 
constants again. It is observed that plating the end 
contacts does not measurably affect the observed time 
constants. 

The preparation of a sample for surface recombina- 
tion velocity measurements involves a number of opera- 
tions. The sample is lapped with 800 grit alundum 
lapping compound on all six sides to remove all traces 
of previous surface treatment and incidental surface 
contamination. Then the surface is chemically etched 
and rinsed in water, nitric acid if the etch contains 
heavy metal ions, and methyl alcohol. The ends of the 
sample are then plated (m-type samples only) and the 
sample is ready for measurement. 

In order to use (20) to calculate surface recombina- 
tion velocity, it is necessary to know the value of D, 
the diffusion constant for minority carriers. This value 
is commonly assumed to be 92 cm? sec~ for electrons 
and 44 cm? sec for holes. However, since the diffusion 
constant is connected with the drift mobility u through 
the Einstein relation, 

pkT 
D=—, (29) 


C0 


and since it is observed experimentally that drift mo- 
bility of minority carriers varies somewhat from one 
sample of germanium to another, it is desirable to 
measure D for the actual samples at hand. This can 
be done by lapping the surfaces of the sample and 
observing the decay constant exactly as for the surface 
recombination velocity measurement. For such sur- 
faces, s is very large and the argument of the tangent 
function in (20) is very nearly equal to 7/2. Setting this 
argument equal to w/2 and solving for D gives 


Z?s1 1 
ni) 
mXr’ 3+ 


where 79 is the observed time constant for lapped 
surfaces, and Z is of course given by (28). The results 
of some actual measurements carried out in this manner 
are tabulated in the following section. The Einstein 
relationship between drift mobility and diffusion con- 
stant has been experimentally verified recently at the 
Bell Telephone Laboratories.’ It should be noted that 
the diffusion constant obtained from these measure- 
ments is the ambipolar diffusion constant. If the sample 
is in the near-intrinsic conductivity range the ambipolar 
diffusion constant may differ considerably from the 
diffusion constant for minority carriers in the extrinsic 
conductivity range.‘ 

If the observed lifetime is quite long, or the applied 

* Transistor Teachers Summer School, BTL, Phys. Rev. 88, 


1368 (1952). 
*W. van Roosbroeck, Phys. Rev. 91, 282 (1953). 


(30) 
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sweeping voltage high, the effect of carriers swept out 
of the sample by the conductivity measuring field can 
no longer be ignored, and a correction to the observed 
lifetime must be made. To account rigorously for the 
effect of this field, a term —pE)dn/dz should be added 
to the left-hand side of (1) and the resulting equation 
solved using (2) and (12). Since the solution obtained 
without this term is itself very cumbersome, it is desir- 
able to avoid the additional complication introduced by 
adding the field term, and instead derive a correction 
to the solution of (1) for the field term which is appli- 
cable for small values of the applied conductivity meas- 
uring field Eo, and which will account for the fact that 
carriers are removed from the sample at one of the end 
contacts. Since the sweeping field is applied along the 
z axis, the number of carriers left in the sample at 
time ¢ will be proportional to (zo—vt)e~‘/*, where v is 
the drift velocity of the carriers under the influence of 
the sweeping field. If this drift velocity is expressed in 
terms of the drift mobility and applied voltage Vo, 
and the factor (zo— vt) approximated by an exponential, 
it can be shown that if uVo7;/20?<1, then 


1 1 pVo 
antioaeeots (31) 


To Ti 20° 


where 7 is the zero-field time constant and 7; is the 
time constant observed when the conductivity field 
has the value Eo= V0/2o. 


EXPERIMENTAL RESULTS 


Results of measurements of surface recombination 
velocity for a variety of surface treatments on n-type 
and p-type samples are given in Table I. The details 
of these surface treatments are listed below. 


1. WAg (Westinghouse Silver Etch): 4 cc HF, 2 cc 
HNO;, 4 cc H,O containing 200 mg AgNO. Allow to 
stand for } hr before use. Normal etch time 1-2 min. 
Preferential to (111) surfaces. Gives rough surface with 
bright (111) reflections. 

2. H2O2 (Superoxol etch): 1 cc HF, 1 cc H2O2 (30 
percent), 4 cc H,O. Allow to stand 3 hour before use. 
Preferential to (100) surfaces. Normal etching time 
1-2 min. Gives fairly smooth but nonspecular surface. 

3. CP4: 15 cc CH;COOH, 25 cc HNO, 15 cc HF, 
0.3 cc Bre. Allow to stand $ hr before use. Nonprefer- 
ential etch, gives bright specular surface. Normal etch- 
ing time, 1 min. 

4. Boiling water: Boil sample 5 min in distilled water. 
Small weight loss, dull surface. 

5. Cu etch: 4 cc HF, 2 cc HNO;, 4 cc H,O containing 
200 mg Cu(NOs;)2. Allow to stand $ hr before use. Non- 
preferential etch, gives bright specular surface. Normal 
etching time 1-2 min. 

6. CP4-A: Same as CP4 above but without Bro. 

7. Pd etch: 4 cc HF, 2 cc HNO;, 1 cc PdCl: solution 
containing 5 mg PdCl,. Allow to stand $ hr before use. 
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TaBLe I. Surface recombination velocities (cm/sec). 











n-type p-type 

Etch s (range) s(mean) s(range) s(mean) 
WAg (2 min) 15- 77 31(+18,—11) 53-— 330 160(+130,—70) 
H,0; (2 min) 18- 96 53(+33,—20) 
CP4 (1 min) 50— 220 100 
Boiling water >2X 104 
Cu (2 min) 210-1000 450 800-5000 2800 
CP4-A (2 min) 260-1240 490 
NaOH elec. (2 min) 120— 960 290 
Pd (1 min) 630- 780 660 80- 430 250 
§:5:1 (1 min) 1600-2800 2100 
Ni elec. (2 min) 250- 670 380 
Chromic acid elec. (2 min) >2xX10* 
H;PO; elec. (2 min) ~10 








Normal etching time, 1 min. Gives bright specular 
surface. 

8. NaOH electrolytic: Ge sample anode, Ge cathode, 
10 percent NaOH solution electrolyte. Current density, 
160 ma/cm?. Normal etching time, 1 min. 

9. 5:5:1 etch: 5 cc HNO;, 5 cc HF, 1 cc HO. Allow 
to stand 4 hr before use. Normal etching time, 1 min. 

10. Ni electrolytic: Ge sample anode, Ni wire cath- 
ode, Ni plating solution (NiCl.-6H,0, 300 g/l; H;BOs, 
30 g/l; HCl to make pH=1) electrolyte. Current 
density, 230 ma/cm*. Normal etching time, 2 min. 

11. Chromic acid electrolytic: Ge sample anode, Ge 
cathode, electrolyte, 2.5 g CrO3, 5 cc HNOs, 20 cc H,0. 
Allow to stand 5 minutes before use. Current density, 
10 ma/cm?. Normal etching time, 2 min. 
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Fic. 5. Individual measurements of surface recombination 
velocity vs surface treatment. 


12. H;PO; electrolytic: Ge sample anode, Ge cath- 
ode, 10 percent H;POg, electrolyte. Current density, 
10 ma/cm*. Normal etching time, 2 min. 


The scatter of individual measurements of surface 
recombination velocity is shown in Fig. 5. For any given 


surface treatment there is usually a rather large spread 


of experimental surface recombination velocities meas- 
ured with etching solutions of the same composition 
made up on different occasions and with different sam- 
ples, sometimes as much as a factor of 2 above and 
below the mean value. This rather poor reproducibility 
is characteristic of surface phenomena of this type, 
since it is nearly impossible to reproduce exactly surface 
conditions from one trial to the next. These difficulties 
arise for several reasons. For instance, the etching agent 
requires an “induction period” to initiate its action, 
which varies from sample to sample. Etching action is 
usually initiated at small surface imperfections such 
as pits or scratches, and thus the microscopic condition 
of the surface, which it is impossible to reproduce 
exactly from one occasion to the next, plays an im- 
portant réle in the activity of the etching process. 
Most of the etching solutions used in this investiga- 
tion are dependent upon catalyzing agents such as 
heavy metal ions or halogen molecules for their activity ; 
the actual mechanisms of the etching reactions them- 
selves, however, are not well understood. For this 
reason it is impossible to say exactly what chemical 
conditions are necessary for reproducible etching, but 
it is observed that etch activity will vary from one 
batch of supposedly identical etching solution to an- 
other, thus adversely affecting reproducibility of sur- 
face conditions. In addition, small local variations in 
bulk lifetime from sample to sample may cause a scatter 
in calculated recombination velocities, which, however, 
should exhibit consistent errors on the same sample. 
The action of some etches is preferential with respect 
to certain crystal planes. This leads one to expect that 
surface recombination velocity might vary as a function 
of crystal orientation for certain types of surface treat- 
ment. Experimentally, however, no correlation could 
be found between measured values of surface recom- 
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VOLUME AND SURFACE RECOMBINATION RATES 


bination velocity and crystal orientation on m-type 
germanium samples oriented with (100), (110), and 
(111) planes in the large yz surface of the sample when 
etched with the (111) preferential WAg etch.® 

It is observed that reproducibility is better for some 
surface treatments than for others. 22 measurements of 
surface recombination velocity using a 2-minute WAg 
etch on n-type samples gave a set of data whose rms 
deviation from the mean was less than +50 percent. 
This may be considered good reproducibility in this 
type of experiment. The WAg etch, in addition to pro- 
viding very low surface recombination velocity, was 
felt to be superior to other low-recombination etches 
such as CP-4 and “Superoxol’’ in respect to reproduci- 
bility of surface conditions. In fact, one may utilize 
this good reproducibility observed for WAg etch to 
find bulk carrier lifetime-directly from the decay con- 
stant to measured on small samples by assuming that 
s=30 cm/sec for n-type or 160 cm/sec for p-type 
samples and solving for 7 in (23). If the difference 
between 7o and 7 is fairly small compared to 7, one 
may obtain a fairly accurate estimate of bulk lifetime 
in this manner. One may assume that D=90 cm?/sec 
for electrons and 45 cm?/sec for holes in this calculation 
without introducing much additional error. 

Note that in general surface recombination velocities 
are not the same for m-type and p-type samples using 
the same surface treatment. In the case of WAg etched 
surfaces, the values s= 30 cm/sec for n-type and s= 160 
cm/sec for p-type material are in good agreement with 
the values obtained by Bardeen and Brattain® for a 
CP4 etched surface. On the other hand, in the case of 
the Palladium etch (No. 7 above) the values of s were 
lower for p-type samples than for n-type. 

Values of the reflection probability R can be obtained 
frem the experimental values of s by (13). This relation 
is shown graphically in Fig. 6 for T=300°K (¢~10' 
cm/sec). 1—R represents the probability of recombina- 
tion for collisions with the surface, and therefore can 
be construed as the effective surface trapping area per 
unit surface area. From this plot it is apparent that the 
surface recombination probability ranges from values 
of the order of 10? for poor surface treatments and 
lapped surfaces to about 10~° for low surface recombina- 
tion treatments such as WAg, CP4, etc. It is not un- 


TABLE IT. Drift mobility and bulk lifetime. 











Ingot Type p, ohm-cm D, cm?/sec pu cm?/volt-sec 7, usec 

Z- 24 n 11 53.4+6.8 2060+ 260 150+10 
Z-103 n 9 40.0+2.6 1550+100 250+15 
Z-105 n 11 45.9+5.7 1770+200 180+10 
Z-100 p 20 78.6+6.1 3050+ 240 180+10 








5R. H. Wynne and C. Goldberg, J. Metals 5, 436 (1953). 
— and W. H. Brattain, Bell System Tech. J. 32, 1 
1953). 
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Fic. 6. Probability 1— R of recombination on collision with a 
surface vs surface recombination velocity. 


reasonable to suppose that the relatively large recom- 
bination probabilities associated with lapped surfaces 
may be explained by trapping of carriers and subsequent 
recombination in the lattice distortions and imper- 
fections at the surface which are created by this treat- 
ment. It is found that when a lapped germanium surface 
is etched with WAg etchant, an amount of germanium 
corresponding to a layer 5u thick (~10* lattice con- 
stants in depth) must be removed to bring s down to 
the order of magnitude associated with the etched 
surface. These data provide some idea of the extent 
of the lattice disturbances mentioned above. 

Table II gives measured values of bulk lifetime, dif- 
fusion constant, and drift mobility of minority carriers 
for the ingots of germanium from which the samples 
used in the surface recombination measurements were 
cut. The values of drift mobility listed for these ingots 
are in reasonably good agreement with the values of 
Haynes and Shockley’ which were measured by an 
entirely different method. For the samples listed in 
Table II the measured ambipolar diffusion constant is 
within about 2 percent of the actual extrinsic minority 
carrier diffusion constant.** 

The authors wish to express their appreciation to Dr. 
J. W. Faust for his help in formulating surface treat- 
ments and etching the surfaces of the samples used in 
the experiments, and to Dr. C. W. Helstrom for helpful 
discussions concerning the diffusion problems encoun- 
tered in the interpretation of the experimental] data. 


7J. R. Haynes and W. Shockley, Phys. Rev. 81, 835 (1951). 

* Note added in proof —Equation (18) represents strictly only 
the conductivity change due to injected minority carriers. Since 
simultaneous injection of majority carriers must take place in 
order to maintain over-all charge neutrality in the crystal, (18) 
should be multiplied by a factor (+1) to give the correct conduc- 
tivity change, where is the ratio of electron to hole drift mobility. 
Only the magnitude of (18) is affected by this factor, the time 
dependence, which is the only part of the expression used in the 
subsequent arguments, being unaffected. 
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Microwaves propagated between two doubly-curved parallel conducting surfaces, in the geometrical 
optics approximation, follow rays obeying Fermat’s principle. The optical distance in this case is determined 
both by the index of refraction of the material between the surfaces and by the curvature of the surfaces. 
The focusing properties of such a parallel plate lens is fully specified by giving the metric of the two-dimen- 
sional optical space associated with the mean surface of the guide. A family of equivalent lenses is shown to 
exist, and formulas are obtained that permit one to go from any one of these lenses to any other. Furthermore, 
conditions are given for there to exist in a family of equivalent lenses a lens employing a unit index of re- 
fraction everywhere. Of particular interest is the demonstration that the Rinehart analog of ‘the Luneberg 
lens and the RCA “R-2R” lens are members of a two-parameter family of lenses with analogous focusing 


properties. 





1, INTRODUCTION 


NE of the techniques used in microwave antenna 
theory to achieve a certain desired focusing of an 
electromagnetic wave is to allow this wave to propagate 
between two closely-spaced conducting plates. Although 
other modes may be used, one usually makes the spacing 
between the plates small enough so that only the trans- 
verse electric and magnetic mode, the so-called TEM 
mode, is propagated. In this mode the velocity of 
propagation is independent of the spacing and equal 
to the velocity in free space, but one may achieve a 
focusing of the electromagnetic wave by using doubly- 
curved plates or joined sections of flat or singly-curved 
plates. The focusing depends on the relative path lengths 
along the various rays. This technique has been used 
chiefly in the design of antenna feeds such as those 
mentioned in references 7 and 9 (see Bibliography at 
the end of the paper). 

The purpose of this paper is to develop the mathe- 
matical theory pertaining to the design of two-dimen- 
sional path-length fenses. This includes lenses in which 
the space between the plates is filled with a substance 
whose index of refraction varies from point to point. 
Disregarding the limitations imposed by practical 
considerations, we shall assume that the index of re- 
fraction n may be given by any positive function of the 
coordinates. The treatment will be confined to a geo- 
metrical optics approximation in which one determines 
the paths of the rays on a mean surface. It is clear that 
such a two-dimensional treatment demands in general 
that the spacing between the conducting surfaces be 
small compared to a wavelength and to the principal 
radii of curvature of the conducting surfaces; however, 
no attempt is here made to determine the range of 
applicability of this geometrical optics treatment. 

This paper is related to the work of Myers, Levy, and 


* Through sponsorship of the Electronics Research Division, 
Air Force Cambridge Research Center, Cambridge, Massachusetts. 

+t Now at the Research Laboratory, Schlumberger Well Sur- 
veying Corporation, Ridgefield, Connecticut. 


Rinehart as given in the set of references at the end and 
represents an extension and generalization of their 
results. 


2. TEM MODE OF PROPAGATION IN 
TWO-DIMENSIONAL GUIDES 


As stated before, the conducting surfaces forming the 
guide are assumed to be parallel, i.e., they have common 
normals at all points.{ Moreover, for theoretical pur- 
poses we assume that the spacing is so small that there 
is little difference between the curvatures of the two 
conducting surfaces, and one may reduce the problem 
to one of two dimensions. One defines a mean surface, 
which is the locus of the midpoints of the common 
normals, and treats the propagation within the guide in 
terms of rays drawn on this mean surface. 

Usually we shall describe a guide only in terms of the 
shape of its mean surface and the index of refraction n, 
the latter being expressed as a function of curvilinear 
coordinates on this surface. To obtain an actual physical 
guide from this mathematical surface, one needs to con- 
struct the two physical surfaces. These surfaces are the 
locus of points lying a fixed distance from the mean 
surface. For example, in the Rinehart-Luneberg guide 
described later and designed for 3.2-centimeter opera- 
tion this distance from the mean surface was taken as 
one-quarter of an inch. 


3. COORDINATE SYSTEM FOR A SURFACE 
OF REVOLUTION 


For the present we shall assume that all guides con- 
sidered, as well as their indexes of refraction, are axially 
symmetric. Their mean surfaces are then surfaces of 
revolution, and » is a function only of the distance from 
the axis. Thus the mean surface is generated by rotating 
a curve C, called the generating curve, about some line 


t This requirement is made merely for the sake of preciseness, 
no harm would apparently be done by using a slightly different 
relationship between the surfaces. For instance, one might prefer 
to take for the conducting surface two neighboring surfaces of a 
one-parameter family of surfaces. 
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which we may take as the z axis of a cylindrical coor- 
dinate system, see Fig. 1. The other two cylindrical 
coordinates are p and @. 

Instead of describing C and hence the surface S$ in 
terms of the function 2(p), it is convenient to use the 
functional relationship between p and s, where s is the 
distance from the axis, or other suitable point, measured 
along C. Thus one may specify the surface by giving 
p(s) or by giving s(p). An important advantage of using 
the function p(s) is that p is always a single-valued func- 
tion of s. In contrast, s(p) may readily be multiple 
valued. Even so, for many problems it is easier to use 
5(p). 

One may use as the surface coordinates either s and @ 
or, on occasions, p and @. In any case the metric of the 
surface, which gives the distance dL between neighbor- 
ing points, is expressed by 


dL2=ds?+ p'dé. (3.1) 


If one multiplies this distance dL by n, the index of 
refraction, one obtains, by definition, the optical distance 
dé between neighboring points (for the TEM mode of 
propagation) ; thus 

di? = n*[ds?+ p'd }. (3.2) 


If s and 6 are taken as the independent variables, Eq. 
(3.2) may be written out more explicitly as 


di?= n(s)*['ds?+-p(s)*dé"], (3.3) 
while if p and @ are used, it may be written 
di?= N(p)(s’ (p)*dp?+p*d@* ], (3.4) 
where V(p)=n(s). 


4. FERMAT’S PRINCIPLE GENERALIZED 


The whole of geometrical optics may be summarized 
by Fermat’s principle which states that the actual path 
of a ray between two points A and B is such as to make 
the variation of the optical path length zero. Usually 


z 
7 


s= arc length along © 


C= generating curve 





axis of revolution 








-p 


Fic. 1. Generating curve of the mean surface of a guide. 


this means that the ray takes the path of minimum 
optical length. This principle accounts for the laws of 
reflection and refraction in space. It may also, of course, 
be specialized to two-dimensional optical problems 
described by rays in a plane. 

We shall generalize the principle to the extent of 
assuming that it holds for the rays on the curved surface 
representing the mean surface of a two-dimensional 
guide. Thus the path of a ray between two points A and 
B on the mean surface S is such as to make 


B B 
of ar=o f ndL=0. (4.1) 
A A 


If the index is constant everywhere, 


B 
5 f dL=0, (4.2) 
A 


and thus the rays are geodesics on the surface.§ 


5. EQUIVALENT GUIDES 


If the optical metrics [see Eq. (3.2) ] for two different 
guides may be set equal by assuming the proper func- 
tional relation 

s'= f(s,0), 0’ =g(s,0) (5.1) 


between the coordinates on their mean surfaces, one 
may refer to the two as equivalent guides. These func- 
tional relations between the coordinates (s,#) and 
(s’,0’) provide one with a point P’ on the mean surface 
S’ of one guide corresponding to each point P on the 
mean surface S of the other. One has thus a point-by- 
point mapping of the S on S’. This mapping preserves 
the optical distances between neighboring points as 
given by the metric in (3.2), and hence the line integrals 
of the optical distance along equivalent paths, i.e., along 
paths joining equivalent points, are equal. 

Since optical distances along corresponding paths on 
S and S’ are equal, it is clear by Eq. (4.1) that the rays 
describing the propagation of the electromagnetic field 
will pass through equivalent points, the paths being in 
each case extremal (usually minimum) optical paths. 
That is, the mapping functions (5.1) may be applied 
directly to the rays. The rays will thus exhibit the same 
topological relationship to one another in equivalent 
guides. One may summarize these statements by saying 
that the optical goemetry is the same for all equivalent 
guides. 


6. EQUIVALENCE FOR AXIALLY SYMMETRICAL 
GUIDES 


Since we are considering for the present only guides 
having complete axial symmetry, the optical metrics are 
of the form of Eq. (3.3), or Eq. (3.4). If we use bars over 
quantities to indicate those relating to a particular 


§ For experimental confirmation of this theory, see references 
2 and 3. 
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guide and unbarred quantities for those of any other 
equivalent guide, then the equivalence of the optical 
metrics, by Eq. (3.2), requires that 


n® (ds?-+- p°d&*) = ni? (d8°+-p'd&). (6.1) 
One may achieve this result by requiring that 
nds=nds, np=nmp, 0=8. (6.2) 


This leads to the following set of integral relationships, 


[ras [vs =A (8) (6.3a) 
np= nip = B(8) (6.3b) 
ds/p= | ds/p=C(8), (6.3c) 

Jaton f 


where a and 4d, b, and 6 represent values of s and & 
corresponding to equivalent points in the two guides. 

Now 7:(8) and (8) are known, since they correspond 
to the given guide. If one chooses any other mean 
surface described by p(s), ome may, in principle at least, 
determine the index of refraction n(s) needed to make 
the guides equivalent as follows. One obtains the 
mapping function §= f(s) by solving Eq. (6.3c) for 
§. From Equation (6.3b), therefore, 


n(s)= B(f(s))/e(s). (6.4) 


On the other hand, if 2(s) is specified, one obtains f(s) 
from the first equation and then p(s) from Eq. (6.3b). 
Since for any actual surface, see Sec. 12, |dp/ds| <1, 
the p(s) obtained above need not represent a realizable 
surface. 


7. FAMILY OF GUIDES EQUIVALENT TO 
THE LUNEBERG LENS 


A flat guide with index of refraction given in terms of 
the radius r=8 by 
(2—r*)!, for r<l 
fi(r)= 7 (7.1) 
, for r21 


has the property that rays from a source on the unit 
circle r=1 will be focused into a parallel beam in the 
region r2 1. For practical considerations one eliminates 
the portion of the guide outside the unit circle letting 
the electromagnetic field spread normal to the plane. 
This does not seriously influence the beam in the plane 
of the surface. The flat guide within the unit circle with 
index (2—r’)! is referred to as the Luneberg lens.'! The 
restriction of radius to unit length is no real restriction 
since one may use any arbitrary length as the unit of 
length. 

If one finds a curved plate guide whose outer edge is a 


'! This is the two-dimensional equivalent of the perfect optical 
lens discussed by R. K. Luneberg, see reference 6. 
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circle of unit radius that has the same optical geometry 
as the Luneberg lens, one may substitute this new 
guide for the Luneberg lens in the complete guide of 
(7.1). What is more, the new guide will exhibit the same 
focusing property as the Luneberg lens when the outer 
guide is omitted and one operates the lenses in free 
space. 

One may obtain such an equivalent guide corres- 
ponding to any arbitrary shape described by p(s) by 
determining the proper m(s). Since for a flat guide 
§=p=r, Eq. (6.3c) becomes 


f as/o= [ dr/r=\nr. (7.2) 
b 1 
Thus the mapping function is 

r=s@)=ex f ds/p, (7.3) 


b 


where s=b corresponds to r= 1. Since by Eqs. (6.3b) and 
(7.1), B(8)=r(2—r°’)!, one has by Eq. (6.4) 
n(s)= f(s)[2— f(s)? ]}!/p(s). (7.4) 


If one employs p as the independent variable in place of 
s, then 


F (p)=exp f Ls’ (p)/p jdp (7.5) 


replaces f(s) in Eq. (7.4), and N (p) is written in place of 
n(s). 

On the other hand, if one specifies V (p), the required 
shape for equivalence to the Luneberg lens is obtained 
as follows. Solving Eq. (6.3b), which is now 


pV (p)=r(2—Pr*)3, (7.6) 


for r, keeping in mind that 0@<r<1, one obtains the 
mapping function 


r=F(p)=(1—(1—p*N*)*}*. (7.7) 


Next differentiating Eq. (7.1) with respect to p, one finds 
that 


s’(p)/p= (dr/dp)r™. (7.8) 
From Eq. (7.7) therefore 
s'(p)=3(1+ (1—p?°N?)!][1+pN'/N ]. (7.9) 


The integral of this is the required shape function s(p). 
Example 1—Determine the surface having V(p)=1 
which is equivalent to the Luneberg lens. 
From Eq. (7.9) 


)=3f [1+ (1—p?)-*]dp=3(p+sin"p), (7.10) 


the constant of integration being determined such that 
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TABLE 1. Lenses theoretically equivalent to the Luneberg lens. 

















Surface Generating curve Index needed [n(s) or N(p)] Permissible range 
1. Plane S=p [2—p,?}* 0<p<i1 
(Luneberg 
lens itself) 
2. Cone s=kp p*[2—p*}} O<p<1i1 if k>0 
; p>1 if k<0 
3. Cylinder p=1 e~*[2—e*}} s>0 
4. Rinehart s=}(p+sin~"p) 1 0<p<il 
— 
5. Generalized iia —_ {2[1+ (1—p?)? P*—p?} 
Rinehart s=(1—8)p-+h sin™p [1+(—p*)) Pe OSeS1 
surface 
— p*) B— 4 
6. Hemisphere s=sin“'p (201+ (1 P= 6%} O<p<il 


Li+(i—p*)* F 








s=0 on the axis. This is the mean surface of the Rinehart 
analog of the Luneberg lens. { 

By Eq. (7.7) the mapping function giving the corres- 
pondence between points in the Luneberg lens and the 
Rinehart analog is 


r=[1—(1—p?)!}}. (7.11) 


Thus, in particular, r=0 for p=0, and r=1 for p=1. 


8. RELATIONSHIP BETWEEN N(o) AND s(o0) 
AT THE EDGE 


A rather striking relationship exists for the above 
family of guides between the index of refraction N (p) 
and the slope of the generating curve, as described by 
S(p), at p=1. 

From Eq. (7.8) 


14+ (1—p2N2)! N+pN’ 
2N (1—p2N2)! 


s'(p)= 





and since by Eq. (7.5) (recalling that r=1 when p=1) 


N(i)=1, (8.1) 
one has 
s‘(1)=3K, (8.2) 
where 
N+pN’ 
K=lim —————- (8.3) 


1 (1—p2N2) , 


Since the denominator in Eq. (8.3) vanishes, K and 
hence s’(1) is infinite unless the numerator also vanishes. 


The numerator vanishes only if N’(1)=—1. Thus 
N’(1)=—1 unless the surface has a vertical tangent at 
the edge. 


If N’(1)=—1, one evaluates A by L’Hospital’s rule. 
Thus 





2N’+pN” 
K=lim 
el | N+pN’ K 
(1—p?N?)! 


2—N"’(1) 


’ 


] See references 9 and 10. 


and therefore by Eq. (8.2) 
N" (1) =2—K?=2[1—2s’(1) ]. (8.4) 


A surface having a horizontal tangent at the edge will 
have s’(1)=1 and hence N’”’(1)= —2. In fact for such a 
surface 

N(i)=1, 


Ni(i)=—-1, N"(1)=—-2. (8.5) 


9. CONICAL SKIRT TERMINATION 


As seen in Sec. 7, the Rinehart-Luneberg (RL) lens, 
which is the geodesic (or path length) analog of the 
Luneberg variable dielectric lens, has a vertical tangent 
at its edge. This means that the electromagnetic wave 
on leaving the lens would be propagated vertically. To 
obtain horizontal propagation, as required by the 
optics, a toroidal bend may be used, see Fig. 2. Since 
(see Sec. 7) the transition from the mean surface of the 
RL lens to the horizontal plane representing the propa- 
gation in free space should be abrupt, the toroidal bend 
introduces an optical error and the rays do not emerge 
exactly parallel. 

One may minimize the optical effect of the toroidal 
bend by making its radius small and by refocusing; i.e., 
by moving the source up into the lens to compensate in 
part for the converging effect of the bend. On the other 
hand one may, within the limits of the geometrical 
optics that considers the propagation to be along rays 
on the mean surface, achieve perfect optics by replacing 
the toroidal bend by the conical skirt termination 
shown in Fig. 2. In this device the rays on the mean 


Outer surface 
/ Mean surface 


Outer surface 
/. Meon surface 


inner surface 






Inner surface 





Reference tine 


Toroidal bend\> 45° Conical mirror 7 


(a) Toroidal Bend Termination (b) Conical Skirt Termination 


Fic. 2. Generating curves for the toroidal bend and conical skirt 
termination of the Rinehart-Luneberg lens. 
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surfaces are turned abruptly into the horizontal plane 
by a 45° conical mirror formed by the inner conducting 
surface. 

From the point of view of electromagnetic theory the 
conical skirt, if indefinitely extended, forms an image of 
the aperture. This image will obviously have the 
electric and magnetic vector in proper directions for 
propagation in a horizontal plane; moreover, the fields 
cf the aperture and its image will be at right angles so 
that one may expect only a very small amount of 
coupling between them. (This also means that one 
would not expect the termination, in itself, to cause 
serious reflections.) 

Tests on a lens with conical skirt termination indicate 
that this termination works somewhat better than the 
toroidal bend. 


10. GEODESIC ANALOG OF A FLAT GUIDE 
WITH VARIABLE INDEX 


In this section we shall obtain convenient expressions 
for a unit-index guide that is optically equivalent to a 
given flat guide with a variable index of refraction 
n(r). From Eqs. (6.3), since p=8=r and u(s)=1, 


=f n(r)dr (10.1) 

p=rn(r) (10.2) 
ff asio= fi de/r=tnr, (10.3) 
b 1 


By this choice of lower limits, s=0 is made to corres- 
pond to r=4 and s=b to r=1. Solving Eq. (10.3) for r, 


s we 
raexp f ds/p=exp | s’ (p)dp/p. (10.4) 
A. ¢c 


If n(r) is known for the flat guide, Eqs. (10.1) and 
(10.2) express in parametric form the relationship be- 
tween the s and p of the curved guide. One may, of 
course, obtain an explicit expression for s(p) or p(s) by 
eliminating the parameter r from between these equa- 
tions. 

On the other hand, if p(s) or s(p) is given, one obtains 
immediately from Eq. (10.4) the mapping function 
r= f(s) or r=F(p), and thus from Eq. (10.2) 


n(r)=p/r=F'(r)/r=p(f(r))/r. (10.5) 


Example 2.—Determine the geodesic analog of a 
plane with index n(r)=1r—*. By Eqs. (10.1) and (10.2), 
using the lower limit d=1, 


s= f dr (r'-*—1)/(1—) (10.6) 


and 
=—ygl—k. 
p=r'-*. 


(10.7) 
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therefore, 
s=(p—1)/(1—&). (10.8) 


This represents a cone with a slant angle y, the angle an 
element makes with the horizontal plane, given by 


secy=s/(p—1)= (1—k). (10.9) 


In order for Eq. (10.8) to represent a realizable surface 
one must have |secy| 21; therefore, |1—k| <1 or 
O<k<2. 

Example 3.—Find what n(r) is required in a plane to 
produce a complete analog of a spherical surface with 
unit index. 

For a sphere of unit radius 


p=sins, (10.10) 
and therefore by Eq. (10.4) 
r=exp f cscsds=a tan(s/2), (10.11) 
b 


where a=cot(b/2). Thus, s=0, s= 432, and s=7z corres- 
pond respectively to r=0, r=a, r= ©. By Eq. (10.6) 


n(r)=r— sin[ 2 tan—(r/a) ]=2a(a?+r*)". (10.12) 


The arbitrary parameter a can be given any positive 
value one chooses. Thus, there are an infinite number of 
flat surfaces having the same optical geometry. They are, 
however, very simply related. Since from Eq. (10.12) 


an(r)=2[1+(r/a)?}"', (10.13) 


one sees that an(r) is always related in the same way to 
r/a. Thus one can scale down any particular solution by 
dividing by a if at the same time one multiplies the in- 
dex of refraction by a. We shall refer to this transform- 
ation as a transformation of scale. 

All rays starting from some point Q on the unit 
sphere having coordinates (s,@) will pass through the 
image point P at the extremity of the same diameter 
and having coordinates (r—s, +7). Likewise by virtue 
of the mapping in Eq. (10.11) all rays from a point Q’ 
in the plane with coordinates (r,6) will pass through the 
image point P’ having coordinates (a?/r, +7). 

The flat plane with index m(r)=2a/(a’+r’) is 
Masxwell’s fisheye. 


11. SEMIPHYSICAL PRESENTATION OF THE 
EQUIVALENCE BETWEEN GUIDES 


Consider a flat guide with index n(r) and its geodesic 
analog. All points in a ring of the first lying between r 
and r+Ar will be mapped on a ring of the second of 
width As in such a way that all optical distances are 
preserved. This means, in particular, that the optical 
thickness of the rings and their optical circumferences 
are the same. Mathematically, therefore, 


As=n(r)Ar (1.1) 


2xp=2xrn(r), 
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which are seen to be essentially Eq. (10.2) and the differ- 
ential form of Eq. (10.1). Finally Eq. (10.3) may be 
obtained by dividing the first of these equations by the 
second. Optical equivalence is, therefore, achieved by 
preserving the optical thickness and circumference of 
corresponding rings. 

One can thus visualize the mean surface of the 
Rinehart-Luneberg lens as that surface produced by 
taking the rings of the flat Luneberg surface relaxing 
their associated index of refraction to unity and at the 
same time letting them expand so as to keep their 
optical thickness and circumference unchanged. Note 
that the outermost ring, since it has an index of unity, 
will not expand. It will, however, stand on edge since its 
inner and outer edges have, neglecting higher orders of 
the differentials, the same circumference. 

It is clear from these considerations that the optical 
geometry of a guide is fully defined by the functional 
relation between the optical circumference and the 
optical distance from the axis, or some reference circle, 
along the surface. Letting 270 represent the former and 
r the latter, the optical properties of the guide are thus 
fully defined by the function ¢=o(r), and in particular 


o=n(s)p(s), r= f m6. (11.2) 
0 


12. REALIZABILITY OF A UNIT-INDEX GUIDE 


It is of interest to inquire if an equivalent unit-index 
guide, or geodesic analog, exists for every flat guide 
employing a variable index n(r) and such a flat guide for 
every unit-index guide. 

For a guide with unit index of refraction, Eqs. (11.2) 
become 


o=p and r=s, (12.1) 


and hence o(7) becomes simply p(s). From Fig. 3 we 
have dp=ds cosy, and thus |dp/ds| <1. Or in terms of 
a(r) 

—1<do/dr<i. (12.2) 


Consider next a flat surface with an index n(r) that is 
any arbitrary positive function of r. For such a space 


c=rn(r), T= fo (r)dr; (12.3) 
therefore, ; 
do/dr=(n(r)+rn’ (r) ]/n(r) (12.4) 
or 
n' (r)=[do/dr—1 Jn(r)/r. (12.5) 


Since n’(r) can have any value from —* to ~, no 
restriction is imposed on do/dr in such a guide. 

If a flat guide is to have a unit-index guide as an 
analog, do/dr satisfies Eq. (12.2). Therefore, by Eq. 
(12.5), since (r) is positive, 


—2/rgn'(r)/n(r) <0. (12.6) 
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Integrating this inequality between any arbitrary ro and 
r>ro, one obtains the inequality 2 In(ro/r) <In[n(r)/ 
n(ro) ]| <0 or its equivalent 


n(ro)ro?/r? <n(r) <n(r0). (12.7) 


Thus if one is to find a unit-index guide whose mean sur- 
face is a surface of revolution** that is equivalent to a 
flat guide with index of refraction n(r), this m(r) must 
not increase with r or decrease faster than 1/?’. 

The above limitation can be understood in the 
following way. Take the limiting case of m(r) being 
constant at r=ro and consider the infinitesimal ring 
(see Sec. 11) at this distance. Since the index is constant 
throughout this ring, it behaves as it would if it were a 
part of a plane with constant index of refraction, the 
geodesic analog of which is again a plane. The ring thus 
expands uniformly in proportion to the values of (ro). 

If n(r) were to increase with r at r=ro, upon relaxing 
the index to unity the outside edge of the ring would 
need to be expanded proportionally more than the 
inside to preserve the optical distances, this would 
cause the outer edge to become ruffled. The ring could 
not therefore be a part of a surface of revolution with 
unit index. 

The other limiting case if for m(r) to decrease as 1/r? 
at r=ro. The ring at r=ro then behaves as part of a 
plane with index given by c?/r*, c being an appropriate 
constant. Since the analog of such a plane is the same 
plane inverted in the circle r=c, see Example 2 for 
k=2, the relaxation of the index to unity will cause the 
above ring to reverse its inner and outer edges. Any 
faster decrease in the index will reduce the optical 
circumference of outside edge of ring still further relative 
to inside edge. The relaxation of the index will then not 
only reverse inner and outer edges but will cause a 
ruffled, or scalloped, edge to be formed, which again is 
not permissible for a surface of revolution. 


13. NONISOTROPIC ANALOG OF THE 
RINEHART-LUNEBERG LENS 


The functional relationship between s and p does not 
define a unique surface since a reflection of a part of the 


** The statement is not true if one permits the unit-index guide 
to have a more general type of mean surface. 
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Fic. 4. A family of curves having the same functional relationship 
between s and p, namely: s= }(p+sin™'p). 


surface in a horizontal plane intersecting the surface 
does not change this relationship. In Fig. 4 is shown a 
family of curves Co, Ci, C2, etc., all of which have the 
same relation between arc length s and radial distance p, 
namely, 


, 


s=4$(p+sin—'p). (13.1) 


The curve Co is the generating curve of the mean surface 
of the Rinehart-Luneberg lens; its height is lettered /po. 
If one draws a horizontal line through Cp at a height 
ho/2 and reflects the portion of Co above it in this line, 
one obtains C,. Similarly by reflecting the curve C; ina 
line drawn at half its height 4, one obtains C2. The 
process can be continued indefinitely. Each of the curves 
can be used to generate a surface of revolution that can 
theoretically be used as a mean surface of a lens. 

The height of the curve C, is h,=/Mo/2n, which ap- 
proaches zero as ” approaches infinity. The end result 
is a generating curve lying along a straight line but 
whose differential length between a point A at p and a 
point B at p+dp is s’(p)dp. The significance of the mean 
surface in an optical sense for such unit-index guides is 
that it determines the optical distance between points, 
this distance being measured along geodesics. Therefore, 
the surfaces generated by the limiting curve of the 
family is a flat sheet on which the metric is 


di*=s"(p)dp*+ p'd®, 


the same as for the Rinehart-Luneberg lens—that gen- 
erated by Co. One may, however, interpret this metric 
as the optical distance between neighboring points in a 
nonisotropic dielectric slab with index of refraction 


(13.2) 


s’(p), for radial rays 


N(p)= (13.3) 


1, for rays perpendicular to the radius. 


The argument is only used to suggest the nature of the 
relationship. The equivalence of the metric for the 
nonisotropic slab and that for the Rinehart-Luneberg 
lens insures their optical equivalence. 


KUNZ 


Practically, one might construct an approximation to 
such ‘a nonisotropic lens by using obstructions in the 
form of circular ridges as suggested by the curves C,. 
Any means, however, may be used that slows down the 
rays only in the radial direction. One serious difficulty, 
however, is the singularity of s’(p) at p=1, see Eq. 
(13.1). 


14. GEODESICS ON SURFACES OF REVOLUTION 


From Eqs. (3.1) and (4.2) the equation for a geodesic 
on a surface of revolution is given in variational form by 


5 { dL=3 J Cds?-+ pte?! 
- f [s(p)-+p°8"(p)¥dp=0, (14.1) 


where the primes indicate derivatives with respect to p. 
Applying Euler’s equation (from the calculus of varia- 
tions) to the dependent variable @ and its derivative 6’. 


p°-d0/dL=h=constant, (14.2) 
or in terms of d@ and ds 
d0=hds/[_p(p?—h*)*}. (14.3) 
From Fig. 5 one sees that 
sina= pd6/dL, (14.4) 


where a is the angle between the geodesic and the 
meridian ; therefore Eq. (14.2) may be written 


p sina=h. (14.5) 


This is the fundamental equation for the behavior of a 
geodesic on a surface of revolution. For certain purposes, 
however, one may use the alternative form given in Eq. 
(14.3). 

At the point of closest approach of the geodesic to the 
axis a= 71/2; therefore h is equal to the value of p at that 
point. 

If one imagines a particle of unit linear momentum 
moving along the geodesic, p sina is its angular momen- 
tum. In terms of such a particle the fundamental Eq. 
(14.5) is just a statement of the conservation of angular 
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momentum. One can, in fact, show that a mass particle, 
constrained to move on the surface of revolution, but 
otherwise free, will move along geodesics on the surface. 
This follows from the fact that the force acting on the 
particle is normal to the surface and thus furnishes no 
torque to change the angular momentum of the particle. 


15. A GENERAL FAMILY OF LENSES}t 


Consider a surface of revolution S (see Fig. 6) 
generated by a curve C whose arc length s is given by 


(15.1) 


where A and B are constants. Let this surface S be 
attached to a cone K along the unit circle p=1. It will 
now be shown that all the geodesic rays passing on the 
same sidef{ of axis of revolution can be made to be 
parallel in a plane representing the development of the 
cone K if the following conditions are met: 


s=Ap+B sin“, 


(1) The source of the geodesic rays is any point P on 
the unit circle p=1, hereafter referred to as the feed 
circle. 

(2) The slant angle y of the cone is taken to be 


w=sec!2A. (15.2) 


This limits the choice of A, since clearly |A| must be 
greater than 3. 
(3) The rays considered must pass through S. 


One may choose as an aperture any convenient 
straight line in the development of K, and for a proper 
choice of this aperture one can presumably deform the 
cone in the neighborhood of this aperture so that the 
aperture becomes a straight line. This is the same idea 
used in the RCA “R-2R” Lens (see reference 1). The 
guide is terminated at this aperture and from there on 
the electromagnetic wave is propagated in free space. 

If a ray starts from the point P in a direction making 
an angle a; with the meridian through P, then on passing 


axis 








Fic. 6. 


tt Professor Wade Ellis of Oberlin College and Mrs. E. C. Fine, 
who was then at the Air Force Cambridge Research Center, worked 
closely with the author in obtaining the results of this section. 

tt The rays passing to the right and to the left of the axis may 
form two separate beams. 
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Fic. 7. Path of a general geodesic ray when the cone K 
is developed. 


through S it will emerge from S at some point Q where, 
by Eq. (14.5), it will make the same angle a; with meri- 
dian through Q. If one designates by @, the number of 
degrees in the arc PQ of the feed circle measured in the 
direction the ray circles the axis§§ then the shape of the 
surface S will determine the functional relation between 
6, and ay. 
In order for the rays to be parallel in the development 
of the cone K, see Fig. 7, one must have 
O*—0,=a,R, (15.3) 
where R is the radius of circular arc NQ’ and @& is the 
value of 6, for the limiting ray corresponding to a,—0. 
This insures the equality of arc lengths VQ and NQ’, 
thus permitting the joining of the pair of points Q and 
Q’ in forming the cone. 
Since, as shown earlier, / is the distance of the geo- 
desic ray from the axis at its nearest point, by Eq. 


(14.3) 
h s'(o)d 
Wek cf 


ae (15.4) 


Thus using Eq. (15.1) 


h dp 
a= 2il a f — 
1 p*(p?—h*)! 


Bf * } (15.5) 
1 pL (p?—h*) (1—p?) }3 


where by Eq. (14.5) h=sinay. 

The first integral in Eq. (15.5) may be found in most 
standard tables and the second may be easily converted 
into a standard form, or, more directly, it may be 
evaluated by contour integration. Evaluating these 





§§ Only rays that pass to right of axis will be considered in this 
discussion. For these @ and 6; are measured counterclockwise as 
one looks down on S. The direction is reversed for the rays passing 
to the left of the axis. 
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integrals Eq. (15.5) becomes simply 
T 
6,=2A (=-a.)+Br- (A+ B)r—2Aa;. (15.6) 


Comparing this with Eq. (15.3), one sees that for the 
rays to be parallel in the cone K when developed one 
must have 


o*=(A+B)xr (15.7) 
and 


R=secy=2A. (15.8) 


From Fig. 7 the perpendicular distance between a ray 
and the reference ray is R sina. The maximum width of 
the beam composed of rays passing on the same side of 
the axis is therefore R, the outer ray corresponding to 
a= w/2. 


Case I. A+B=1 


For this special case, by Eq. (15.1), the surface S is 
given by the relationship 


S(p)= (1—k)p+k sin“'p, (15.9) 


which defines the generalized Rinehart surfaces, see 
Table 1. Since 


s’(p)= (1—k) +R(1—p*)-4, (15.10) 


one has that s’(0)=1 and s’(1)=< (except for k=0). 
Thus, these surfaces all have a horizontal tangent at the 
axis, and, except for k=0 which corresponds to a plane 
surface, a vertical tangent at p=1. 

For this one-parameter family of lenses, by Eq. 
(15.7), @#* =a and the reference ray leaves diametrically 
opposite the source P. The rays passing to the left and 
right of the axis thus form a single beam. By Eq. (15.8) 
the cone K must have a slope angle given by 


-W=sec—2(1—R). (15.11 


) 


Two important special cases already known are: 


(1) RCA “R-2R” lens corresponding to k=0. By 
Eq. (15.11) the cone K is a 60° cone. 

(2). Rinehart-Luneberg lens corresponding to k=}. 
For this choice of k, y=sec™'1=0° and the cone K 
becomes a flat sheet. 


Case II. A+B>1 


The rays going to one side of the axis come out parallel 
to the reference ray, for which by Eq. (15.8), 


6,=0*=(A+B)r>rz. (15.12) 


The rays on the other side of the axis are parallel to a 
similar reference ray which is rotated through the same 
angle &* in the opposite direction. The angle between the 
reference rays and hence the angle between the beams 
is thus 


2(A+B)x—24=2(A+B-—1)z. (15.13) 


KUNZ 


Thus the beam is split, and one may obtain any desired 
angle between the beams. However, due to an unsymme- 
trical illumination of the aperture for each of the two 
beams, the secondary pattern for each beam will be 
distorted. 

As an example, consider the following special case. 


A=2, B=0. 


By Eq. (15.7) each half-beam will be rotated 2x 
radians with respect to the source and hence will circle 
the axis and come out in the direction of the source. The 
two beams will thus form a single beam. 

For this choice, by Eq. (15.1), S is a 60 degree cone, 
and by Eq. (15.8) the cone K is a cone with a slant 
angle of Y=sec~'4. Thus the device consists of merely a 
cone on a cone. It may easily be verified that this lens 
may be obtained by halving the RCA “R-2R” lens by a 
plane through the axis and then joining up the edges of 
one of these halves. The paths of the geodesic rays as 
traced on the surfaces are clearly not affected by this 
modification and one may verify the above behavior of 
the rays. 


16. GAUSSIAN CURVATURE OF THE OPTICAL SPACE 


We shall generalize our treatment by letting the index 
of refraction be a function of s and 6. Thus the optical 
metric for our surface of revolution will be 


di? = n?(s,0)[ds?+ p*d# }, (16.1) 


where as before, df is the differential optical distance 
and s the arc length along the generating curve. 
For a space in which the metric is given by 





df? = gi; (du!)*+ goo(du*)*, (16.2) 
the Gaussian curvature K is given by" 
Ri212 1 te) 1 0222 
ices |—(—_= 
g 24/ gl au! Vg du! 
09/1 Ogun 
Ou? \y/g Ou? 


where Ri2i2 is a nonzero component of the Riemann 
tensor and 





£11 0 | 
g= = £11822. (16.4) 
| 0 £22 
If we let u'=s and u*=8@, and write 
a=V/gu, b=/gn, ab=/g, (16.5) 


Eq. (16.3) reduces at once to 


if od /10b 0 10a 
at 
abLds \a as 00\b 36 
See I. S. Sokolnikoff, Tensor Analysis (John Wiley and Sons: 
Inc., New York, 1951), p. 169. 
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By a comparison of Eqs. (16.1) and (16.2) 


a=n, b=np; (16.7) 


therefore, 


=i 0 {p On 10 /10n 
tion f'+—(*—)+-—(-—)] 
np Os \n Os p 00\n 06 


—Iif 0 0 1 2 
K=— '+—(— inn )+- — In (16.8) 
prt Os\ as p oF 


or 





In terms of the operator'! !! 


100 12 
V2=- —p—+— —, (16.9) 
pos ds p* 0F 
one may write that 
K=-—---{—+vV’* inn) (16.10) 
n?\ p 
For the special case of n= 1 everywhere 
K=—p"/p, (16.11) 


and for the special case of a flat guide p=s=r, and 
therefore 


[= — nV? Inn. (16.12) 


If, as in the Rinehart-Luneberg lens, one has s(p) 
given rather than p(s), one replaces Eq. (16.11) by 


K=-—s"'/(ps’’). (16.13) 


Another modification of (16.11) is made by multiplying 
numerator and denominator by 2p’; then 


K = —2p'p""/2pp' = —dp"/dp? = — (d/dp*)(1/s’). (16.14) 


Applying Eq. (16.12) to the Luneberg lens for which 
n(r)=(2—r*)!, one finds that K=4/(2—r*)*. Thus 
K=3 atr=0 and K=4 at r=1. On the other hand, for 
the Rinehart analog of the Luneberg lens s’=}3[1 
+ (1—p*)!], so that by Eq. (16.14) K=4/[1+(1—p?)! 
This agrees with the above K for the Luneberg lens if 
one applies the relation 1—r?= (1—p”)! of Eq. (7.11). 


17, TREATMENT OF GENERAL SURFACES BY MEANS 
OF DIFFERENTIAL INVARIANTS 


One may express the relationship between equivalent 
guides by means of the differential invariants of the 
two-dimensional optical spaces associated with the 
guides. 


If the optical metric defined on the mean surface is 
given by 


dg? = gi (du!)?+ goo(du?)?, (17.1) 


where u! and 4? are orthogonal coordinates in the space, 


i!!! Fora flat surface (s=p) this operator becomes the Laplacian. 


the differential invariants we wish to consider are 


1 Of dh 1 Of Oh 
PP ae Bad Bo 


211 Ou! Ou! goo OU? au? 


1 
Ai sittin =A) 17. 3 
J= gu (=) +(> ) ah Om 


and for g=g11g22 


ifa ) ) 
aef=—| — (a2 : += (4 | (17.4) 
J glou'\ gi, OutF Au? \ goo Ou? 


When applied to a plane, A,f represents the square 
of the magnitude of the gradient of f, A:(f,g) represents 
the dot product of the gradient of f and the gradient of 
h, and Asf the Laplacian of f. 

Suppose one considers the equivalence between a 
curved guide with index of one, the optical metric for 
which in a geodesic polar coordinate system is! 


(17.2) 





dj*?=ds’+ D*(s,6)d¢", (17.5) 
and a flat guide whose optical metric is 
dt"? = n?(r,0)(dr’+rdF }. (17.6) 


Optical equivalence requires that the metrics apply to 
the same optical space and hence that the invariants 
of (17.2), (17.3), and (17.4) be equal for coordinates s, ¢, 
and coordinates r, 0. 

Letting u!=s and u?=4, one has by Eq. (17.5) gi1.=1 
and g22= D*; therefore, 


Ofdahk 1 Of dh 
&.(f)=——+-— — —, (17.7) 
ds 0s D*? 06 06 
Ofy\? 1/,0f\? 
Aif= (~) +<() 78) 
Os D*\ db 


and 


sida? =) -(-~)| (17.9) 


On the other hand, letting u'=r and «?=8, one has, 
by Eq. (17.6), giu=n? and go2=n’r’; therefore 


Lpafah 1 afar 
a= ——+—— =} 
r? 6 30 


n*Lor Or 
wl) 2G)} 0 


of 1 of 
r or 4 r 06? 


(17.10) 


nw 





and 


n? 


(17.12) 








12 A geodesic polar coordinate system is defined in various texts 
on differential geometry. See, for example, D. J. Struik, Lectures 
on Classical Differential Geometry (Addison-Wesley Press, Cam- 
bridge, 1950), p. 137. 
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Case I 
n(r,0) is given to find D*(s,¢). 
(1) From Eqs. (17.11) and (17.8) 


. Os 
as (= -) + ~(= ~) == 98°, (17.13) 
y? 
which one solves for s=s(r,@). 
(2) From Eqs. (17.10) and (17.7) 
ds 06 1 Os 0d 
n7A,(s,o)=— —+— — —=0, (17.14) 
Or dr r* 0000 
which one solves for ¢=¢(r,9). 
(3) From Eqs. (17.11) and (17.8) 
1 n* 
—=P= (17.15) 





Ai 0o\" 1 40¢\? 
Qe) 
or r?\ 06 
so that one obtains D* as a function of r and @. 


(4) One uses the solutions of Eqs. (17.13) and (17.14), 
i.e., s=s(r,0) and ¢=¢(r,), to obtain the functions 


r=r(s,o), 0=0(s,¢). 


These are then substituted in Eq. (17.15) to obtain 
D*(s,o). 


(17.16) 


Case II 


D*(s,¢) is given, to find n(r,@). 
From Eqs. (17.9) and (17.12), letting v=Inr, 





ira Ov 1 dv 
Aw=— —(D— )+- =(5 —)|-0 (17.17) 
DLdas\ as D d¢ 
and . 
ifa 00 1 00 
hihines —(D- )+- —(; ~)|-0 (17.18) 
Das Os D as 
From Eqs. (17.7) and (17.10) 
dv 00 1 dv 00 
A, (v,0)=—_ —+—_ — —=0, (17.19) 
Os Os DP’ 06 06 
r 
4 
Ayre 
A) 
Y 
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while from Eqs. (17.8) and (17.11) 


2 Oo ) 
sen(“) +42) -4 arm 


and 
1 
sp=(= =) + ~(— -) = =—, (17.21) 
nr? 
Let A be defined by the equation 
a 1 00 
—=)\—— (17.22) 
ds D ae’ 
then by Eq. (17.19) 
1 dv 06 
——=—)— (17.23) 
D d¢ Os 


Since from Eqs. = 20) and (17.21) Ajw=A,0, one sees 
that by Eqs. (17.22) and (17.23) A=1. 
One may, Peo proceed as follows. 


(1) One obtains a solution /(s,o) of the differential 


equation 
1 a 
|< (v= y+ ~(—=)|-o (17.24) 
Dias D d¢ 


which by Eqs. (17.17) and (17.18) may be used as the 
expression either for »=Inr or for @. 

(2) One uses the equations, see Eqs. (17.22) and 
(17.23), 


ov 106 1 dv 00 


, (17.25) 
ds Ddd Dd Os 
to obtain the other of the quantities v and 8. 
(3) One obtains m? as a function of s and ¢ from either 
Eq. (17.20) or (17.21), i.e., 


1 1 


r= =_, 


r’A 10 r’A 1v 





(17.26) 


(4) Since from the first two steps one has v=2(s,¢) 
and 6=8(s,¢), one may obtain from them the inverse 
functions 


s=s(v,0), ¢=(7,8). (17.27) 


From these we may express n” as a function of v and @, 
or, since v=Inr, as a function of r and @. To make 
n(r,0) physically meaningful we select the positive 
square root of n’. 


18. DISCONTINUOUS INDEX OF REFRACTION 


If the index of refraction changes discontinuously at 
some boundary EF, see Fig. 8, one may form a small 
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rectangle ABCD with AB in region 1, having an index 
m,, and with CD in region 2, having an index np. 
Furthermore, at all regular points on the boundary the 
rectangle may be made small enough so that EF is 
approximately a straight line. One, therefore, may make 
AD and BC negligible in optical length compared to 
AB and CD. Thus two moving points at A and B will be 
separated at first by an optical distance ”,AB but on 
moving negligible distances AD and BC are separated 
by an optical distance of n2CD that differs from 2,A B by 
a finite amount equal approximately to A B(n2.—m,). 

One may easily convince oneself that there can be no 
such abrupt change in optical distances between moving 
points if the index is continuous. This means that all 
guides equivalent to a guide with a discontinuous index 
of refraction » must themselves have a discontinuous 
index. That is, one may not expect to find exact geodesic 
analogs of the usual optical system, since these employ 
dielectric lenses with boundaries across which the index 
of refraction changes abruptly. 
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An estimate is made of the accuracy with which the “spectrum” of a finite sample of Gaussian noise ap- 
proximates the spectrum of the original noise from which the sample was taken. The sample is considered to 


be spliced to form an endless loop, thereby making 
discussed. 


INTRODUCTION 


HE widespread use in statistical theory of the 
notion of power spectra of random processes 
stresses the importance of the experimental task of 
estimating these spectra. There is, however, a funda- 
mental difficulty inherent in such an estimate, namely, 
that in order to measure accurately the power spectrum 
of a given random process it is necessary that there be 
available an infinite sample of this process with which 
to work. It is important that some estimate be made of 
the errors that arise because of the necessarily finite 
samples to which one is limited in practice. Tukey’ has 
given, without proof, a clear-cut method by which 
one may estimate the accuracy of a spectrum derived 
from autocorrelation measurements. Also, Grenander 
and Rosenblatt? have stated four theorems, without 
* This work was supported by the U. S. Navy Bureau of Ord- 
nance under Contract NOrd 7386. 
1J. W. Tukey, “Sampling Theory of Power Spectrum Esti- 
mates,” Symposium on Application of Autocorrelation Analysis 
to Physical Problems (U.S. Office of Naval Research, June, 1949). 


2 U. Grenander and M. Rosenblatt, Proc. Natl. Acad. Sci. U. S. 
38, 519 (1952). 


it a periodic function. The effect of this splice is also 


proof, relevant to the estimate of the power spectrum 
of a discrete parameter random process. 

We shall not enter here the argument as to which is 
better to measure, the autocorrelation function or the 
spectrum. But, rather, let us set for ourselves the task 
of estimating the error in the measured spectrum due 
only to the finiteness of the sample. Since in order to 
define a spectrum we require that the function exist 
for infinite time, we shall imagine that we splice the 
head of the sample to its tail, thereby forming an end- 
less loop. This loop could then be played through a 
spectrum analyzer or a correlator as if it were an in- 
finite sample. We shall then determine the accuracy 
with which the spectrum of the original noise can be 
estimated from that of this sample loop. We shall re- 
strict ourselves to stationary Gaussian processes having 
spectra which are strongly continuous.T 

It should be pointed out that the above splicing 
technique is very often applied in practice. The addi- 

t “Strongly continuous” here means that the magnitude of the 


Say does not change significantly in frequency intervals of 
the order of the reciprocal of the sample length. 
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SPLICE 


Fic. 1. Plot of sinr2xft. Note the discontinuity at the 
splice as the function repeats itself. 


tional question then appears as to the effect of the 
discontinuity at the splice. It must be realized that the 
splice is an essential difficulty, and is not merely a 
peculiarity of an experimental procedure. 


CASE OF SIMPLE SINUSOID 


Before attacking the more difficult problem of random 
noise, let us examine the result of splicing a pure sine 
wave signal in the manner indicated above. We shall 
take, for a sine wave of frequency f/f, the portion con- 
tained in the time interval from 0 to 7, and splice this 
into an endless loop. Figure 1 shows the resulting func- 
tion, which is periodic with period 7, and we shall call 
this function sinr27/ft. Its Fourier series expansion is 


sinr2x ft= 449+ > ml Om cos(Zrmt/T) 
+b, sin(2rmt/T)] (1) 


where . 
Cn = ar f sin (2x ft) cos(2xmt/T)dt 
0 


and 


T 
bn = or f sin(2x ft) sin(2xmt/T)dl. 
0 


Integrating these we find 


4m= (1—cos2xfT)[(2efT— 2am)" 
+(2efT+2em)] (2a) 


bm = sin2x fT (2efT—2em)-'— (2xfT+2xm)"]. (2b) 
Similarly for cosr2x ft, we have 


cosr2x ft= }co+ dL mL cm cos(2rmt/T) 
+d,,sin(2rmt/T)} (3) 
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Fic. 2a. Power spectrum for a 4-cps sine wave chopped after 16 
periods plus 180°, then spliced into an endless loop. 
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where 
Cm= sin2a fT[(2efT—2em)-!+ (2efT+22m)-] (4a) 


dm= — (1—cos2xfT)[ (24 fT— 2am) 
—(2xfT+22m)]. (4b) 


Notice that the spectrum that results is affected not 
only by the gross length of the sample but also to some 
degree by the phase at which the splice is made. In 
fact, as the sample length approaches an integral 
number of periods, i.e., as T—»n/f, we can see from 
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Fic. 2b. The same chopped after 16 periods plus 90°. 


Eqs. (2) and (4) that 


a»—0 for all m 

b,.—0 for m¥n 

6,1 

Cm—0 for m¥n 

Cal 

d»—0 for all m. 
Furthermore, one can see that as 7 for any phase 
at the splice, the frequency separation of the sub- 
stantially nonvanishing lines will approach zero. Figure 
2 shows a plot of $(@n?+0,,.”) for a 4-cps sine wave, with 
two different gross lengths of sample, each with both a 
90° splice and a 180° splice. From these one can see the 
tendency for the spectrum lines to bunch together at 
4 cps as T increases. 

SPECTRUM OF NOISE SAMPLE 


Let us turn now to the problem of finding the spec- 
trum of a random noise sample of length T. We shall 
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denote the noise by J(#), choosing as our sample the 
portion of J(¢) contained between 0 and T. Splicing the 
ends to form an endless loop we produce a periodic 
function which we shall call g(t), where 


g(stnT)=I(s), O<s<T, n=0,1,2,---. (5) 


Thus g(¢) is defined for all t (—-0 <i<o), 

Since g(t) is periodic, its spectrum will contain dis- 
crete power impulses at integral multiples of 1/T. on 
the frequency scale. The power spectrum of the original 
noise, /(¢), which we shall call W(f) will be, in general, 
a continuous function of frequency. In order to facili- 
tate the comparison of the measured spectrum with 
W (f), let us define the function w(f) to be T times the 
power of g(t) at the frequency /, and if f is not a mul- 
tiple of (1/7) then we shall define w(f) by means of 
the power impulse of g(¢) at the nearest integral mul- 
tiple of 1/T that is less than f. (In practice, the 
measured spectrum will have some such smoothed out 
appearance as w(f) because of the finite filter width in 
the spectrum analyzer.) 

Writing g(¢) in its Fourier series form we have 


g(t)=4Aot+ > mL Am cos(2rmt/T) 
+B, sin(2xmt/T)]. (6) 


Hence the power spectrum w(f) can be written as 


w(m/T)=43T (An2+ Br?) ( 


where A,, and B,, are the Fourier coefficients of Eq. (6) 
In order to determine A, and B,, we write J(¢) in its 
Fourier form after the fashion of Rice* (we may do this 
since we are assuming /(¢) to be Gaussian), 


T(t) = > Lax cos(2xfxt)+Bx sin(2zxf;t) ], (8) 


where the a,’s and §;’s are independent Gaussian ran™ 
dom variables with zero mean and variance of W (f;,)A/; 
and f,=kAf. Equation (8) is to be considered in the 
limit as Af—0. 

When the sample of length T is chosen from /(t) 
and spliced to form g(¢), each sinusoidal component of 
(8) is affected in the same manner as the sinusoids of 
Eqs. (1) and (3). Hence, using Eqs. (2), (4), and (8) 
and collecting terms we have for the Fourier coefficients 
of g(t) 


Aa™ > Lax sin2rf,.T+8;(1 — cos2rf,T) 
X[(2ef,T—24m)+ (24 fi. T+20m)-] (9a) 


Bn=>D iL —ax(1 —cos2rf.T)+Bx sin2xf,T ] 
XC (2r fT —24m)— (20 f,.T+22m)]. (9b) 


From (9) we see that A,, and B,, are Gaussian random 
variables with zero mean. This is evident from the 
fact that a, and 6, are Gaussian and A,, and B,, are 
each just a sum of many of these a’s and §’s. Further- 
more, as shown in Appendix A, A,, and B,, are sta- 
tistically independent. Consequently the probability 


3S. O. Rice, Bell System Tech. J. 23, 282 (1944). 
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Fic. 2c. The same chopped after 64 periods plus 180°. 


distribution of w(m/T) can be calculated using Eq. (7). 
This calculation is performed in Appendix B and the 
result is 


(w)d dw | wf/i 1 
oj 7 ee a -—+-— 
eeu T(Gn®,)! 2T \QGm =] 
wfi 1 
XI —(—-— (10) 
2T\ Qn Om 
where 
w=w(m/T), Q@m=(Am)w, and ®n=(Br2)w. 


Next we must find expressions for the average values 
@, and ®» in terms of W(f). To accomplish this we 
square Eq. (9), average, and then pass to the limit 
as Af ~df—0. Using 


(cu2)0= (B2)0= W (fad f 
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we obtain 


Gn = f “w( f) sin’x fT (wfT—xm)- 
7 +(rfT+am)" df (11a) 
a f W(f) sine fT (efT—xm)— 
; — (xfT+xm)“ fads. 


From Appendix C we find that these integrals can be 
reduced to 


(11b) 


Qm= TW (m/T)+bna | 
®m= TW (m/T) bn | 


(12a) 
(12b) 
where 64 and 6,,, are both bounded by 

|8m| <0.025V ,+17Xm 


where Y,, is the maximum value of |W(f)—W(m/T)| 
considered as a function of f, X,, is the maximum value 
of |W(f+m/T)—W(f—m/T)| considered as a func- 
tion of f, and r is a number which depends not only 
upon m, but also on the frequency at which W(/) 
becomes substantially flat. The value of r can be esti- 
mated from Fig. 3. 

It is the magnitude of 6,, that represents the effect 
of the splice upon the spectrum measurement. The 
accuracy of the measurement of the ordinates of the 
spectrum will be poor at those frequencies which corre- 
spond to small values of m (i.e., when 6,, is large) 
simply because the average value of the measurement 
will have little relation to the ordinate of the true 
spectrum W(m/T). For larger values of m, @n and @, 
will each more closely approach T—'W(m/T). When 
this occurs we replace @,, and ®,, by this value in Eq. 
(10) and the probability distribution simplifies to 


p(w)dw=e—”'"dw/W (13) 





Fic. 3. r as a function of y and m from Eq. (C-.5). 
See text and Appendix C. 
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where we have written W for W(m/T). Note that since 
m= f oT, where fo is the frequency at which the ordinate 
is being considered, it can be large if the sample length 
T is large or if we only consider very high frequencies. 
In other words, the low frequency limit in the power 
spectrum measurement is determined by the sample 
length. 

For example, let us say we are dealing with a spec- 
trum that is substantially zero beyond 10 cps, and we 
have at our disposal a 5-minute sample from which we 
wish to determine the spectrum. Let us find the magni- 
tude of 6 at a frequency of 0.05 cps. We have the 
following: 


T= 300 sec 
y= 10/0.05 = 200 
m= (0.05) (300) = 15. 
From Fig. 3 we find that r~0.035. If we further say 


that the ordinate of the spectrum at 0.05 cps is of the 
same order of magnitude as the spectral maximum, we 
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Fic. 4. Probability distributions p(w,) for various n. 


can say that 6~0.025W+0.035W =0.06W, or 
Qis ~ B15 ~0.00333W (0.05) 


with about a six percent error. For frequencies lower 
than this the error will be greater, and for higher fre- 
quencies it will be smaller. 

For small enough 6, Eq. (13) shows that the result 
of a measurement of w(m/T) has a mean and standard 
deviation each equal to W (m/T). Such a large standard 
deviation will not lead to reliable results, consequently 
the measuring procedure must be slightly altered. If 
one could obtain » independent samples of J(t), each 
of length 7, then for each value of m we could average 
all 2 measurements of w(m/T), and we shall call this 
average w,. It can be easily shown that this average 
has a probability distribution given by 


n” Wn\™! dW,a 
(2) emt, 
(n—1)!\W W 





p(w,)dw,= 


Plots of p(w.) for various m are shown in Fig. 4. The 
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mean of each of these distributions is W, and as n gets 
large one can see from Fig. 4 that the distribution nar- 
rows, and eventually approaches a Gaussian with 
standard deviation Wn}. 

In actual practice, instead of obtaining m independent 
samples, one would use only one long sample (as long 
as is practical). He could then obtain the same results 
by averaging adjacent ordinates of the measured dis- 
crete spectrum, provided they are independent of each 
other. If the noise spectrum is substantially flat over 
frequency intervals of a few times the reciprocal of the 
sample length, this will be approximately correct. This 
averaging procedure is approximated in practice by the 
use of a wide filter in the spectrum analyzer. 

Using Eq. (14) we can calculate probability bands 
about W in which w, is likely to fall, and from these 
we can compute confidence regions about the measured 
value w, in which W is likely to lie. For example, in 
Fig. 5, the region bounded by the two outer curves is 
the 90 percent confidence region, if we were to say that 
W is within this band we would be right 90 percent of 
the time. Similarly the two inner curves bound the 
68.3 percent confidence region (this corresponds to plus 
and minus one standard deviation in the case of the 
Gaussian distribution). 


CONCLUSION 


We have thus indicated the accuracy and resolution 
one may obtain in estimating the power spectrum of 
noise using only a finite sample of it. The method dis- 
cussed is that of the analog type as contrasted with 
Tukey’s treatment! of the digital method. Our results 
are in agreement with his when the splice effect is 
negligible. Note that the number of his “degrees of 
freedom” are double our value of » for any particular 
point on the power spectrum curve since the sine and 
cosine parts of each frequency component are inde- 
pendent and form a degree of freedom each. Note 
further that Tukey’s method gives twice as many 
degrees of freedom as we would estimate at each experi- 
mental point on the measured power spectrum, but this 
is quite understandable since in his case two adjacent 
points are not completely independent and in our case 
they are.{ In the limiting case of flat noise the total 
number of degrees of freedom is approximately the same 
according to either method. 
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t His equivalent filter shapes are approximately triangular and 
they overlap quite a bit from point to point, whereas in our pic- 
ture, the filter is assumed to be square in shape and nonover- 
lapping. (J. W. Tukey, private communication.) 


POWER SPECTRA 




















. : SS 

















10 100 1000 


Fic. 5. Confidence bands of W for a measured value w. The 
index shown on the curves is the average percentage of time that 
W will be greater than the ordinate for any given n. 


APPENDIX A. STATISTICAL INDEPENDENCE 
OF A, AND B,, 


The independence of A,, and B,, can easily be shown 
by writing down the joint probability of the ky, term 
in the summations of Eqs. (9a) and (9b). Using the 
fact that a, and ; are statistically independent and 
each Gaussian, one can show that the joint proba- 
bility of 
mk = [Lax sin2xf,T+8,.(1—cos2xf,T) ] 

XC (2afxT— 24m) + (20 f,. T+ 24m) ] 
and 
be [—a,z(1 —cos2rf.T)+B: sin2rf,.T | 

XC (20 fT — 24m) — (24 f.T+24m) | 
is Gaussian with no cross term in the exponent, indicat- 
ing that a». and b,,, are independent. This together 
with the fact that an, and b,,; are independent for 


k¥j (since ax, Bx, aj, and 8; are independent) it follows 
that A,, and B,, are also statistically independent. 


APPENDIX B. THE PROBABILITY DISTRIBUTION 
w(m/T) 


Using the result of Appendix A, we can compute the 
probability distribution of w(m/T). Equation (7) states 


w(m/T)= 2 T (A a+ B,,*), 


and using the fact that A,, and B,, are independent and 
each Gaussian, we can show easily that 


\d _ (ae | w me) 
one rT = =) 


w/fi i 
XI {= —_——) jw, 
2T\a B 
where, for convenience we have omitted the subscripts 


m, and have let w=w(m/T). 
Note that if @=@, we have 





p(w)dw= (@T)“ exp(—w/ @T)dw. 
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APPENDIX C. PROOF OF EQS. (12a) AND (12b) 
We shall evaluate here the integrals of Eqs. (11) 


anf Wi) sin’x fT[ (wf T—am)— 
. + (nfT +m)“ Pdf 
@,.= f W (f) sin’ fT[ (wf T—xm)— 
‘ — (xfT+xm) Fdf. 
Let us write for convenience 
Qm=11+Te+Ts 
Bm=I,—I2+T; 


where 
sin’x fT 


I= | W(f)—————¢ 
i 0 (xfT—xm)? ’ 


sg sin’ fT 
2f WW (f)-—_—____"4f 
0 (xfT—xm) (xfT+7m) 


sin’mfT 
b= f W(f)—————7 f 
0 (xfT +m)? 


If we assign W(—f)=W(f), 73 may be written as 
P sin’r fT 
h= f Wf)—7. 
= (xfT—xm)* 
Hence /, and /; may be combined to give 
sin’ fT 
Ith= f W(f)———df. 
= (xfT—x2m)* 


Furthermore, the integrand in J, can be split using 
partial fractions to give 


“ ‘sin’s fT 
2=— | W(f)————?¢/ 
™m (xfT—xm) 
» sin’x fT 

— | W(f)———/. 
rm» (xfT+7m) 

As before, these two integrals may be combined to give 

1 ¢* sin’a fT 
I,=—]} W(f)————7y. 


arm _.. (xfT —1m) 


Then, changing the variable of integration, we get 


o f/m sin?x fT 
ht+h=] Wt - d 
sa J (5+) (xfT)? J 





—o 


i ¢* sin’r fT 
I,=— w(“ +f) df. 
am _.. afT 


and 





(C.1) 
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TABLE I. Values of Cy=2(Si(2xk+2x)—Si(2rk)]. These are 
the coefficients of W[(m+)/T] in Eq. (C.2). One can obtain C; 
for negative values of k from the relation C_x=C,-_1. 











k Ce 
0 0.4514 
1 0.0236 
2 0.0082 
3 0.0042 
4 0.0025 
5 0.0017 
6 0.0012 
7 0.0009 
& 0.0007 
9 0.0006 
10  0,0005 
11 0.0004 
12 0.0003 
13 0.0003 
14 0.0002 
15 0.0002 
16 0.0002 
17 0.0002 
18 0.0001 
19 0.0001 


20 0.0001 





If we assume W (f) to be substantially constant over 
integrals of length 7—' cps, we may approximate the 
integral 7,+/; by the series 


x m+k (EDIT sin?a fT 
ht+l;= > w( yf df 
kao 7 (xfT)? 


This integral can be immediately written as 











(R+D)IT sin? fT 1 
J —df= —[Si(2rk+ 2x)— Si(2zk) ] 
k/T (xfT)? nT 


where 


Si(x)= f t sintdt.§ 
0 


If we let 
Cy= a [Si(2ek+2r)—Si(2rk) ] 


then we may write 


(C.2) 


1 «x m+k 
h4th=- > cv(=—). 
k=—oo is 
Values of C; are given in Table I. From this table we 
can see that if W(/) is substantially constant in the 
neighborhood (m—4)/T < f<(m+-4)/T then 


I,4-]3= T-(0.975W (m/T)+e' ] 
where 
0<e’ <0.025[W 4) 


§ For tables of this function see “Tables of Sine, Cosine, and 
Exponential Integrals,” Vol. I and IT; also “Tables of Sine and 
Cosine Integrals for Arguments from 10 to 100,” National Bureau 
of Standards, Washington, D. C. 
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or better 
I,+13= TW (m/T)+0.025Y », | 
where 


Let us now estimate an upper limit for 72. One must 
be careful in doing this since the integral 


oo 
f x! sin?xdx 
0 


diverges. Rewriting Eq. (C.1), using the fact that 
sin'xfT — sin’xr(—f)T 
xfT  x(-f)T 





we get 


incor f Aho (2)-w(-2) 


Let us choose a y large enough so that W(f) is sub- 
stantially constant for f>ym/T. Then 


: ( a sin’ fT 
= (rm 
0 afT 


r(+3)-0(-) 


(C.4) 








Let 
Xm= maximum of |W (f-+m/T)—W(f—m/T)|. 
Then 


(l+y)m/T 
I2<Xmn(wm)> f (xfT)— sin’ fTaf. 
0 


But 


(l+y)m/T 
f (xfT)— sin’x fTdf 
= (2nT)-{ In[2e(1-+-y)m]— Ci 2m (1-+-y)m]+0.577} 


where 


Ci(x)= f t— cosidt. 


Hence 


T2SXm(2x?mT)-{In[ 24m (1+) J 
— Ci[2arm(1+-) ]+0.577} 

or 

Ts <1rX,,/T 
where 
r= (2x’m)—{In[ 2rm(1+-7) ] 

+[2m(1+7)}'+0.577} (C.5) 
where we have used the fact that for large enough x 

| Ci(x)| <a-. 


Figure 3 shows r as a function of m and y. 
Thus we may say that 


Qm= TW (m/T) 26m | 
Bn TW (m/T)+6m | 


where |6,| <0.025VYn+rXm, Vm is given by Eq. (C.3), 
Xm is defined by (C.4) and r is to be obtained from 
Fig. 3. 

We might point out here, that although 7 is a rather 
vague quantity and hard to pin down precisely, this is 
not a serious disadvantage since a glance at Fig. 3 will 
show that r is a rather weak function of +. 
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Approximate emissivity calculations for CO, have been carried out, as a function of optical density, at 
300 and at 600°K. The calculations involve the assumption that the rotational lines overlap extensively. This 
condition appears to be satisfied at total pressures above about 1 atmos. Comparison of the values calcu- 
lated from spectroscopic data with the emissivities tabulated by Hottel and his collaborators shows satis- 
factory agreement. The analysis presented in this manuscript emphasizes the fact that it is possible to obtain 
reasonable estimates for the engineering emissivity without performing extensive analytical work, provided 
the physical principles are understood and the needed spectroscopic data are available. 





I. INTRODUCTION 


N a series of recently published papers we have 

attempted to estimate the engineering emissivities 
of diatomic molecules by utilizing basic spectroscopic 
constants. Tractable methods of calculation have been 
developed for (a) complete overlapping between rota- 
tional lines' and for (b) separated rotational lines.’ 
Although accurate theoretical calculations of gas emis- 
sivities from spectroscopic data for polyatomic mole- 
cules involve formidable cofnputationa! difficulties, it is 
to be expected that approximate calculations can be 
carried out with relatively little effort by utilizing ap- 
proximations similar to those introduced for the study 
of diatomic molecules.'* For CO, it has been found that 
the total absorptivity at room temperature is sub- 
stantially independent of pressure at pressures exceeding 
about 1 atmos,* thus indicating extensive overlapping 
between rotational lines. Accordingly, it is not un- 
reasonable to base the present preliminary calculations 
on the assumption that extensive overlapping between 
rotational lines does occur. The results would be ex- 
pected to apply, for example, at room temperature for 
total pressures in excess of 1 atmos and at 3000°K at 
pressures in excess of 11 atmos.’ Since it appears quite 
likely that the actual range of validity of the results 
covers a larger range of total pressures, we shall not 
hesitate to show that calculated emissivities agree 
reasonably well with empirical data obtained‘ at 1 
atmos at a temperature of 600°K as well as at 300°K. 
Perhaps the most important conclusion which can be 
derived from the present analysis is the statement that 
the analytical labor involved in making approximate 
emissivity calculations for a polyatomic molecule with 
overlapping rotational lines is trivial provided the im- 


* Supported by the U. S. Office of Naval Research under Con™ 
tract Nonr-220(03), NR 015 210. 

T Failure to receive Referee’s report and loss of the original 
manuscript caused unusual delay in publication time. 

1 (a) S. S. Penner, J. Appl. Phys. 21, 685 (1950); (b) J. Appl. 
Mech. 18, 53 (1951); (c) S. S. Penner and D. Weber, J. Appl. 
Phys. 22, 1164 (1951). 

*(a) Penner, Ostrander, and Tsien, J. Appl. Phys. 23, 256 
(1952); (b) S. S. Penner, J. Appl. Phys. 23, 825 (1952). 

5’ Holm, Weber, and Penner, J. Appl. Phys. 23, 1283 (1952). 

*W. H. McAdams, Heat Transmission (McGraw-Hill Book 
Company, Inc., New York, 1942), Chapter III by H. C. Hottel. 


portant physical principles are understood and the 
needed spectroscopic data are available. It is only fair 
to note, however, that all of the needed integrated 
intensity measurements have not yet been performed 
for COz in spite of the fact that several papers dealing 
with intensity measurements have been published re- 
cently.*-7 


II. BASIC RELATIONS 


Theoretical considerations of intensities for various 
transitions of the CO, molecule were summarized by 
Dennison a number of years ago.® We shall reproduce 
here the parts of the analysis which are useful for 
making approximate emissivity estimates on COs. 

Frequencies (v) and wave numbers (w) corresponding 
to transitions between fixed energy levels are given by 
the Bohr frequency condition 


v=cw= (AWy’+AW 2’)/h, (1) 


where AWy’ and AW,’ represent, respectively, the 
changes in vibrational and rotational energy corre- 
sponding to the frequency v, # is Planck’s constant, 
and ¢ represents the velocity of light. According to the 
results of Dennison and Adel,’ the rotational (W 2’) 
and vibrational (Wy’)- energies for CO. are given, 
respectively, by the expressions 


W x! = he(P?+j—P)[0.3925 —0.00058 (m1 +3) 
4-0.00045 (1t:+1)—0.00307 (+3) ]-+1.7hc(P—1) (2) 
and 


Wy! = hel 1351.2(rry+4)-+672.2(m2+1)-+2396.4(rty+4) 
~0.3(my+4)?— 1.3(me+1)?—12.5(mg+3)? 
+5.7 (m1 +4) (m2+1)—21.9(m1+3) (ms+4) 

—11.0(m2+1)(ms+3)]. (3) 


5 A. M. Thorndike, J. Chem. Phys. 15, 868 (1947). 

6D. F. Eggers, Jr. and B. L. Crawford, Jr., J. Chem. Phys. 19, 
1554 (1951). 

7 Weber, Holm, and Penner, J. Chem. Phys. 20, 1820 (1952). 

8D. M. Dennison, Revs. Modern Phys. 3, 280 (1931). 

®D. M. Dennison, Revs. Modern Phys. 12, 175 (1940). 

%” Slight corrections to the listed numerical values have beer. 
noted by W. S. Benedict and his collaborators. For the present 
purposes these corrections are negligibly small. Recent work on 
frequencies of lines belonging to CO: is described in the following 
papers: Benedict, Herman, and Silverman, J. Chem. Phys. 19, 
1325 (1951); Taylor, Benedict, and Strong, Progress Report on 
“Infrared Spectra of H,O and CO, at 500°C,” Contract Nonr 
248-01, The Johns Hopkins University, March, 1952. 
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APPROXIMATE EMISSIVITY CALCULATIONS FOR CO, 661 


Here , m2, and m; are the vibrational quantum num- 
bers associated with the fundamental vibration fre- 
quencies v;, v2, and v3, respectively. The quantum 
number / measures the angular rotation, in units of 
h/2m, which is associated with the degenerate (bending) 
v2 vibration. The rotational energy levels are identified 
by the rotational quantum number j. 

The selection rules’ for the transitions mn2'n;—> 
n;'n2'"'ns3’ may be summarized as follows. For the 
perpendicular bands, Anz odd, An; even, AJ= +1. For 
the parallel bands, Anz even, An; odd, Al=0. The rota- 
tional selection rules are Aj= +1, 0. 

The amplitude factors* for given rotational transi- 


tions are! 
i ee 
(4; ‘) = aoe , (4) 
j(G+1) 

















sv f-P Bets Xi" 
(4. ') = : . ed : (4; ') ’ (4a) 
j(2j+1) 2j+1 
j 2 G+!) (jFI+1) 
(4; m) = aap ’ (4b) 
47(j+1) 
jo\? 
(4:) wi (4c) 
j 2 (Gl) (j4l—1) 
(. j-l “) = : . : , lA, (4d) 
47(2j+1) 
2 j—] 
(4. :) aie (4e) 
2(2j7+1) 
ies 2 G+) (G+l+1) 
(. : ‘us) = nb 40, (46) 
47(2j—1) 
-i0\" j+1 
(4 :) ese, (4g) 
2(2j—1) 


In Eqs. (4) to (4g) the convention has been adopted 
that the amplitude factor corresponding to the transi- 


ee , , vv 
tion j, 7’, l’ has been written as (4 I ). 


The integrated absorption for a given transition 
from the lower-energy level mn2'n3; 7 to the upper- 
energy level m,'n2'"'n;' ; 7’ is designated as S=S(n,n2q!n3; 
j-nyn?'"'n;'; 7’) and is given® by the approximate 
relation 


S= (82° yN7/3hcQv'Qr’) 
X {exp—[Wy’ (11,12,03)+ Wr' (GD) kT} 


Xere( Ai ') eL1—exp(—hv/kT)}. (5) 


"The amplitude factors given in Eqs. (4) to (4g) are 4 times 
as large as those listed by Dennison (reference 8) because we are 
using a Fourier series in time of the form cos(2rvt) rather than 
exp(2rivt). 


In Eq. (5), v=v(myne'n3; j->m1'n2!"'n;’' ; 7’) is the Bohr 
frequency corresponding to the indicated change in 
the quantum numbers;{ NV7=total number of mole- 
cules per unit volume per unit pressure ; gj = statistical 
weight of the upper state with g;,=2j7’+1 for //=0 
and gjy~=2(27’+1) for ’ 40; B=B(nyn2!ns—ny'ne'"’n;’) 
is a factor which must be determined empirically and 
corresponds to the matrix element of the electric mo- 
ment in the molecule associated with the indicated 
change in (vibrational) quantum numbers;{ Q2’ 
=> » git expL— We’ (j,])/kT ]=complete rotational 
J 
partition function; Qy’=>> ©} exp—[Wy’ (1,2,n3)/ 


mi m2 ne 
kT ]=complete vibrational partition function. 

The amplitude factors given in Eqs. (4) to (4g) are 
based on the assumption that vibration-rotation inter- 
actions have a negligibly small influence on amplitude 
factors. For this reason, the use of Eq. (5) involves 
somewhat cruder calculations than were employed in 
the calculations on diatomic molecules with nonover- 
lapping rotational lines.” 

For the purposes of approximate radiant-heat trans- 
fer calculations, it is convenient to use the integrated 
absorption coefficient for a given band. Let 


a (nyno'n3—n,'n2'"’n3') 
=X LX S(ns'ng; jn!" ny; 7’). (6) 
7’ 


7 


For the parallel bands A/=0 and 


2 
vu 
) 3 gvn( ai ') = git 
7’ 


according to the Burger and Dorgelo summation rules, 
which may be verified by use of Eqs. (4) to (4g) by 
utilizing the selection rules. Hence we obtain from Eqs. 
(5) and (6) the useful result 


a(nyne'ns—ny'n2'"'n;') = (82°8?v'N 7/3hcOv) g1 
X {exp— [Wy (n1,2,n3,1)/kT ]}[1—exp(—hv/kT) ] 
with g:=1 for /=0 and g;=2 for /#0, (7) 
where we have set 


W y+ (nyn2,n3)+ W pe (7,1) = Wr (m1,M2,03,1) + W e()), 
Qv= 2X exp(— Wv/kT),Qv'Qr’=QvOr 
with Qr= d g;expl—W(j)/kT], 


and g;:=g;Xgi. If the assumption is not made that 
v= v'~v(nyno'n3 ; O—ny'ne'"’n;' ; 0) a slight correction is 
obtained to Eq. (7).2+% 


t Although we shall not indicate explicitly the changes in the 
(vibrational) quantum numbers involved, both »v and 8 do, of 
course, vary from one vibration-rotation band to another. 

2B. L. Crawford, Jr. and H. L. Dinsmore, J. Chem. Phys. 18, 
983, 1682 (1950). 

18 Eggers and Crawford (reference 6) have shown how to relate 
for CO, the matrix coefficients to the coefficients of Taylor series 
expansions of the potential energy and dipole moment about their 
respective equilibrium values. 
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Fic. 1. Positions of centers of vibration-rotation bands, observed at room temperature, between 300 and 2400 cm™'. The designations 
vs (very strong), s (strong), m (medium), w (weak), and vw (very weak) are those of Herzberg (reference 14). Also shown is the intensity 


ratio Ry/R) max for a blackbody at 300°K. 


For the perpendicular bands A/#¥0 and 


2 
5 i l’ 
DX girvl Ay ) =en, 
f BF 
9 
fie . vv\? 

Dd giv Ay it ) =3en, LD girvl Az o ) = Bi 
7’ f 


Hence 


a(nyne'ns—n;'ne' ng’) = (49° B*v’N 7 /3hcQv) gi 
X {exp—[Wy (11,22,03,!)/kT }}(1—exp(—hv/kT) J, 
where g;= 1 for /=0 and g;=2 for /#0, (8) 
and ° 


a(n yne'ns—n,' nn’) = (49°8?v'N 7/ 3hcQ v)gi 
X {exp—[W vy (m1,n2,n3,/)/kT ]}[1—exp(—hv/kT) ] 
where g;= 1 for /=0 and g,;=2 for /¥0. (9) 


In using Eqs. (8) and (9) it should be noted particu- 
larly that the quantities 6? are generally different for 
the transitions involving /—/+-1 and }->/—1. The ratios 
of the 6? can be calculated theoretically to the har- 
monic oscillator approximation for harmonic bands (see 
Appendix I for details). 

In the same manner as for emissivity calculations on 
diatomic molecules with overlapping rotational lines,’ 
we shall find Eq. (5) useful for the determination of 


“effective band widths” whereas Eqs. (7) to (9) lead 
directly to “average absorption coefficients” for vibra- 
tion-rotation bands. For the calculation of S it may be 
convenient to combine Eqs. (4) to (4g), (5), and (7) 
to (9). For example, for the positive branch of the » 
fundamental of CO:, we obtain 


S~[w (0110; 7-+00°0 ; 7—1)/w(01'0 ; 0-000 ; 0) ] 
X (Qe) Xa(01'0-200°0) X (+1) 
XLexp—(Wr/kT)], (Sa) 


where w denotes a wave number. 


Ill. APPROXIMATE EMISSIVITY CALCULATIONS 
AT 300°K 


The positions and approximate intensities of the 
stronger vibration-rotation bands of CO, at 300°K are 
well known.*--"4 A first approximation to the effective 
band width is obtained by utilizing relations such as 
Eq. (5a) and defining the effective band width as the 
wave-number range for which S exceeds 10~* of its 
maximum value.! Utilizing this definition of band width, 
which we shall refine presently, a summary of vibration- 


“4G. Herzberg, Infrared and Raman Spectra of Polyatomic 
Molecules (D. Van Nostrand Company, Inc., New York, 1951), 
Table 56 on p. 274. Herzberg’s designations of band intensities as 
vs (very strong), s (strong), m (medium), w (weak), and vw (very 
weak) has been used in Figs. 1 to 3 to identify the approximate 
strengths of bands at room temperature. 
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Fic. 2. Positions of centers of vibration-rotation bands, observed at room temperature, between 600 and 4800 cm™. Also 
shown is the intensity ratio R,/R) max for a blackbody at 600°K. 


rotation bands for CO, at 300°K has been drawn in 
Figs. 1 to 3. In these diagrams the abscissa is the wave 
number whereas the ordinate represents, in arbitrary 
units which vary from one band to another, the in- 
tensity. Integrated intensities at 300°K are given in 
Table I in so far as they are available.’ For the strongest 
bands (v2 and v3 fundamentals) the contributions from 
the isotopic molecule C¥O,'* are also shown. This 
chemical species constitutes about 1.1 percent of all of 
the CO2 present and makes contributions to the total 
emissivity which are not negligibly small for moderate 
and large optical densities of emitter. Also shown in 
Figs. 1 to 3 are the spectral intensities of radiation 
emitted by a blackbody at 300, 600, and 1500°K, again 
utilizing arbitrary scales along the ordinates, which 
are, however, consistent for any one of the given 
temperatures. 

Reference to Fig. 1 shows that the principal contribu- 
tion to the total radiant heat transfer at room tempera- 
ture must occur from the spectral region around the 
ve fundamental. Thus it is readily shown! that the 
“limiting emissivity” for CO, at 300°K is in the neigh- 
borhood of 0.4 with more than 90 percent of the total 
contributions made by the wave-number region ex- 
tending from 551 to 830 cm™. 

It is physically obvious that the engineering emis- 
sivity E at 300°K is determined by the 01'0—-00°0 
transition in the limit of zero optical density of the 


emitter. As was pointed out in an earlier publication,' 
the emissivity as a function of optical density (partial 
pressure p of emitterXoptical pathlength L) for very 
small values of pZ can be computed quite accurately 
either by utilizing a treatment applicable for nonover- 
lapping lines? or by using average absorption coeffi- 
cients. The emissivity E67 of the band with center at 
667 cm™, as calculated from Eq. (30) of reference 2a, 


TABLE I. Integrated intensities for vibration-rotation 
bands of CO, (reference 7). 








Band center 





(cm~!) Transition a(cm~?-atmos~!) 
5109 00°0— 20°1 0.426 
4984 00°0— 12°! 1.01 
4861 00°0-—04°1 0.272 
3716 00°0— 10°1 42.3 
3609 00°0-02°1 28.5 
2349 00°0—-00°1 2706 
2137 01'0—20°0 
2094 010-1270 0.147 
2077 00°0— 1110 
1933 00°0--03'0 0.083 
1886 01'0--04°0 . 

721 01'0-—10°0 

668 0110-0270 

667 00°0--01'0 171.5 

648 02°0-03'0 

618 0110-0290 

961 10°0—-00°1 0.0219" 
1064 02°0—-00°1 0.0532* 








*® Unpublished data obtained by D. Weber. 
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Fic. 3. Positions of centers of vibration-rotation bands observed at room temperature between 1500 and 12 000 cm™. Also 
shown is the intensity ratio R,/R) max for a blackbody at 1500°K. 


is plotted as a function of pl in Fig. 4 and labeled 
“approximation for nonoverlapping lines.” The corre- 
sponding quantity obtained by using the average 
absorption coefficient? PX = (171 cm atmos~)/(106 
cm~') over the effective band width extending from 600 
to 706 cm™ gives results which are practically identical 
with those obtained for nonoverlapping rotational lines 
as pL is reduced below about 0.02 ft-atmos. As pL is 
increased Eq. (30) of reference (2a) rapidly fails to 
apply and yields excessively large values not only for 
Ese: but also for the total engineering emissivity E. 
Needless to say, for pL <0.02 ft-atmos, the numerical 
value of Ee7 is to be identified with the value of E. 

As ‘the optical density is increased to moderate 
values of pL, it becomes necessary first to include the 
01'0—00°0 transition for C“O,"*, which is assumed to 
represent 1.1 percent of the total CO», and for which 
the integrated intensity is roughly the same as for the 
C”O,'* species. In treatments using average absorption 
coefficients it is, of course, necessary to add absorption 
coefficients rather than emissivities in regions in which 
partial overlapping between effective band widths 
occurs. Finally, the contributions made by the 10°0— 
0110 and 02°0—01'0 bands must be included; both of 
these are designated by Herzberg" as being of “me- 
dium” intensity whereas the 03'0-02°0 and 11'0-02°0 
bands are labeled “weak” and the 11'0-02°0 band is 
said to be “very weak.” 


Strictly speaking, room-temperature emissivity calcu- 
lations for CO2 cannot be carried further because the 
integrated intensities for the weaker bands are not 
available. However, it is easily shown that results in 
fairly good agreement with Hottel’s data‘ are obtained 
if reasonable estimates are made for the strongest of 
the bands which has not yet been included in deter- 
mining E. 

It is known from rough unpublished measurements’ 
that the integrated intensity a7 for the 10°0—-0110 
band is of the order of 1 cm™ atmos~. Emissivity 
calculations treating a7; as a variable parameter be- 
tween 0 and 10 cm~ atmos™ are shown in Fig. 4. The 
curves labeled Q721:=0, Qa721;> :. and Q721> 10 with 667 
= 171 correspond to the calculated values of E for the 
various assumed values of pL. In making these calcula- 
tions the concept of the effective band width was re- 
fined in so far as the effective band width for the v2 
fundamental was set equal to the wave-number range 
for which SpL/0.7 >0.1. This definition of band width 
for strong bands has the important advantage of making 
the band width a weak function of pL, in agreement with 
empirical measurements.'® The contributions to E 
made by the 00°1—10°0 and 00°1—02°0 bands are very 
small but have also been included. To these bands 
Weber assigns the following values at 300°K: age: 
=0.0219 cm~ atmos, a1964= 0.0532 cm~ atmos". 

In addition to the calculated emissivities, the results 
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APPROXIMATE EMISSIVITY CALCULATIONS FOR CO; 665 


of empirical measurements‘ are also shown in Fig. 4. 
For pL <0.06 ft-atmos, the “empirical” data are really 
extrapolated results. Reference to Fig. 4 shows fair 
agreement between calculated and observed values of 
E for the reasonable values age7= 171 and a72:= 1 cm 
atmos’. At very small values of pZ the calculated 
emissivities§ are larger than the extrapolated empirical 
data but must be considered to be more reliable since 
they depend only on the numerical value of ag¢7, which 
is known with fair accuracy.*~? Our ability to calculate 
E at very large values of pl is limited by the lack of 
adequate intensity data for the transitions 02°0-—-01'0, 
11'0—-02°0, and 11'0—-02°0. However, it is clear that 
as pL is increased sufficiently above 3 ft-atmos, E must 
approach about 0.4 and will then increase only very 
slowly as exceedingly large values of pL are obtained. 

The discussion of emissivity calculations on CO: at 
300°K as a function of pL is mostly of academic inter- 
est. However, it is instructive in so far as it exemplifies 
the inherent simplicity of the analysis if adequate 
spectroscopic data are available. 


IV. APPROXIMATE EMISSIVITY CALCULATIONS 
AT 600°K 

It is evident from Fig. 2 that the major contributions 
to radiant heat transfer at 600°K are made by vibra- 
tion-rotation bands in the spectral region between 890 
and 2400 cm. As pl goes to zero it is expected that 
the total emissivity E will become substantially equal 
to the emissivity of the intense v3; fundamental of 
C”O,'* with appropriate corrections for contributions 
from the isotopic species C"O,!*. Although we are 
unable to complete the emissivity calculations at 600°K, 
because the needed spectroscopic data are not available, 
the engineering emissivity is estimated correctly for 
small values of pl. This result emphasizes the fact 
that relatively accurate emissivity calculations for 
polyatomic molecules can be performed without diffi- 
culty as soon as the necessary intensity measurements 
are available. 

For the weaker bands, i.e., for all bands but the »; 
fundamental, we shall define the effective band width 
as the wave-number region for which S exceeds about 
10- of its maximum value. For the v3; fundamental we 
set the effective band width equal to the wave-number 
range for which (SpL/0.7)>0.1.|| The long wave- 

§ The dotted curve in Fig. 2 corresponds to age7=29 and 
721 = 1 cm? atmos“. Although this curve represents a good fit of 
the empirical data, the result cannot be considered to be significant 
since ags7 is known to be much larger than 29 cm™ atmos?. 

|| Although the definitions of the effective band widths are 
somewhat arbitrary, it is easily shown that calculated values of 
E are quite insensitive to the chosen band widths. This result is 
caused by automatic choice of too small absorption coefficients 
for band widths which are too large, and conversely (compare 
reference 1a). Use of the quantity S$pL/0.7 >0.1 assures inclusion 
within the band width of the v; fundamental of all of the rotational 
lines with average emissivities in excess of 0.1. Substantially the 
same numerical values for the effective band widths of the »; 
fundamental are obtained if the lines with SpL/0.7 >10“m are 


included, where m is a number which does not differ from unity 
by_more than a factor of three or four. 


number limit of the v3 fundamental is set equal to 2410 
cm~, i.e., 11 cm~! beyond the band head. This choice 
allows for the many tails of rotational lines which 
“spill” across the band-head limit. We proceed by 
calculating separately the partial emissivities for se- 
lected wave-number regions. 


A. Contributions of the 00°1—10°0 and 
00°1—02°0 Bands 


The criterion (S/Smax) >10~* for the weaker bands 
leads to band widths of the order of 140 cm™ at 600°K. 
The integrated intensity for a given vibration-rotation 
band at the arbitrary temperature T, divided by the 
measured value at 300°K, is obtained by the use of 
Eqs. (7), (8), or (9). From these relations we find for 
the integrated intensity a of the band whose ground 
state is identified by the set of quantum numbers 
N\N2'nz the result 


a(T°K)/a(300°K) = (300/T) 
X {Qv (300°K) expLW y (00°0)/300k]} 
x {Ov(T°K) Xexp[ Wy (00°)/kTJ}—! 
X {exp— [Wy (2 n2'n3) — W y (00°) ] 
x [(kT)—'— (300k)"} [1 — exp (—hv/kT)] 
<[1—exp(—hv/300k)], (10) 


where 


Qy(T) expLW y (00)/RkT J~[1—exp(— 1388hc/kT) P' 
x [1—exp(—667hc/kT) }°? 
<[1—exp(—2349hc/kT)}"'. (11) 


The use of Eqs. (10) and (11) leads to the following 
results for the 00°1-—10°0 and 00°1--02° bands, re- 
spect ively 2 Q961 (600°K )—-~9.29e961 (300°K) =0.204 cm-? 
atmos; 1064(600°K )—~7.22a1064(300°K) = 0.385 cm~ 
atmos~. Hence the total contribution to the emissivity 
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Fic. 4. Engineering emissivity E as a function of pL for CO: 
at 300°K. Integrated intensities a are expressed in cm™ atmos". 
The results based on empirical measurements (see reference 4) 
are shown as circles. 
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TABLE II. Absolute intensities at 300 and at 600°K for several 
vibration-rotation bands in the 5-micron region, based on intensity 
calculations using harmonic oscillator approximations (see Ap- 
pendix I). The value 0.147 cm~ atmos“ for the sum of the bands 
centered at 2137, 2094, and 2077 cm™ served as standard to fix the 
absolute intensity scale. 


a at 600°K 


a at 300°K 

Transition (cm~?-atmos™!) (cm~?-atmos™!) 
00°O— 110 0.12 0.040 
01'0-12% 0.020 0.032 
01:0-—20°0 0.0045 0.0075 
01'0—12°0 0.00048 0.00079 
0270— 13°0 0.0012 0.0099 
03°0— 140 0.000038 0.0020 
0270-210 0.00019 0.0015 
10°0—21'0 0.00029 0.0026 
11'0-+30°0 0.0000053 0.00026 








obtained from the two bands under discussion is 


E61 + E,oes= 0.052[ 1 —exp(— 1.40 10-pL) ] 
+0.017[1—exp(—4.15X10-*pL) | 
+0.054[1—exp(— 2.75 10-*pZ) ], 


where pL is expressed in cm-atmos. 


B. Contributions Made by the Bands between 
1800 and 2210 cm™ 


It is known from the recent work of Taylor, Benedict, 
and Strong'® that an appreciable number of vibration- 
rotation bands, in addition to the bands shown in 
Fig. 1 in this region, are observable at 500°C.{{ Since 
all of these bands involve transitions to excited energy 
levels, their integrated intensities would be expected to 
increase as the temperature is raised. Rough absolute 
intensity estimates for several bands in the 5-micron 
region are given in Table II at 300 and 600°K. The 
relative intensity estimates were obtained by using the 
greatly oversimplified harmonic oscillator approxima- 
mation described in Appendix I. Thus the contribu- 


TABLE III. Emissivities E2349 for C%O2'* and C¥O,"* at 600°K as 
a function of optical density.* 








PL 





L, Le 
a ft-atmos Ezu9(C"02'®) Eaus9(C¥O2!*) Ezsag(total) 

0.1 0.0033 0.018 0.0005 0.019 
0.5 0.0164 0.033 0.0016 0.035 
1.0 0.033 0.037 0.0025 0.040 
5.0 0.164 0.050 0.0070 0.057 
15 0.492 0.051 0.011 0.062 
50 1.64 0.052 0.011 0.063 
100 3.28 0.053 0.012 0.065 
200 6.56 0.055 0.012 0.067 








* The tabulated values of E21s9(C%O2'*) correspond to the values which 
must be added to Exs49(C#O2"*) in order to obtain Exss(total). They do 
not include emissivities in the (black) region of the v3 fundamental of 
C#0;!* which is overlapped by the »3 fundamental of C#0O,!6, 


46 Taylor, Benedict, and Strong, Progress Report on “Infrared 
Spectra of H,O and CO; at 500°C,” Contract Nonr 248-01, The 
Johns Hopkins University, Baltimore, March, 1952. 

] Data of the type presented in reference 15 are valuable aids 
in performing emissivity calculations. For example, the spectro- 
scopic emission records indicate, at least qualitatively, the vibra- 


tion-rotation bands which must be included, at temperatures up 
to 773°K. 


PENNER 


tions of both mechanical and of electrical anharmonici- 
ties to the intensity have been neglected. 

As is evident from the experimental data shown in 
reference 15, a large number of vibration-rotation bands 
remain for which even rough intensity data are not 
available. For this reason no result can be given for the 
contributions made to the engineering emissivity by the 
vibration-rotation bands between 1800 and 2100 cm“. 


C. Contributions from the v; Fundamental 


From the measured integrated intensity’ for the »; 
fundamental at 300°K and by use of Eqs. (10) and (11) 
we find ae349= 1060 cm atmos! at 600°K. Of the 
total integrated intensity per (cm~') X (cm™ atmos of 
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Fic. 5. Engineering emissivity E as a function of pL for COz 
at 600°K, based on the empirical correlations of H. C. Hottel 
and R. B. Egbert. Also shown are the calculated contributions to 
the total emissivity made by the v; fundamental. 


COs) we associate 98.9 percent with C"O,'* and 1.1 
percent with CO,'*. For the intense vs fundamental 
the effective band width corresponds to the wave- 
number range Awe349 for which (SpZ/0.7) >0.1 with 
the band width of the isotopic species determined simi- 
larly. In general, the CO,'* band is black before the 
isotopic band makes appreciable contributions. The 
emissivities at 600°K, E2349, calculated by the use of 
average absorption coefficients P=a/Aw, are sum- 
marized in Table III as a function of pL. Comparison 
of E2349(total) with the empirical data‘ plotted in Fig. 5 
shows that for pL <0.033 ft-atmos, Eo39(total) is 
practically identical with the engineering emissivity E. 
The result is in accord with expectations and shows 
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APPROXIMATE EMISSIVITY CALCULATIONS FOR CO, 667 


clearly that useful emissivity data can be obtained with 
practically no analytical work once the physical prin- 
ciples are understood clearly. 


D. Contribution of the 02°1—00°0 and 
10°1—00°0 Bands 


From the measured integrated intensities’ we find 
a3609= 9.70 cm~* atmos for the 02°1—00°0 band and 
a3716= 14.4 cm™ atmos for the 10°1—00°0 band at 
600°K. The effective band widths extend from 3539 
cm™ to 3679 cm™ and from 3646 cm™ to 3716 cm“, 
respectively. Using average absorption coefficients the 
following results are obtained: 


E3609+ E37216~0.0047[1 — exp(— 6.92 10~*pL) | 
+0.0017[1—exp(— 1.72 10-'pL) ] 
+0.0017[1—exp(—1.03X10-pL)]. (13) 


E. The Total (Engineering) Emissivity 


The total (engineering) emissivity is obtained by 
adding the partial emissivities from selected spectral 
regions. Since estimates of partial emissivities were 
not included for all of the important vibration-rotation 
bands, no general expression for the engineering emis- 
sivity can be given although useful results have been 
obtained for small values of the optical density (com- 
pare Sec. IVC and Fig. 5). 


APPENDIX I. RELATIVE INTENSITY CALCULATIONS 
FOR CO, USING HARMONIC OSCILLATOR 
APPROXIMATIONS** 


It is well known that harmonic bands with nonzero 
matrix components are predicted even to the harmonic 
oscillator approximation. Calculations of this sort are 
useful for rough relative intensity estimates although 
mechanical as well as electrical anharmonicity correc- 
tions are neglected. Representative relative intensity 
estimates have been given, for example, by Benedict'® 
and by Kaplan.'® For the present applications it will 
be convenient to present a general equation for relative 
intensities of harmonic bands involving changes in the 
vibrational quantum number 72. 

The desired result is obtained readily by using the 
normalized wave functions for the isotropic plane 
oscillator and an integral involving associated Laguerre 


** The author is indebted to Dr. W. S. Benedict and to Dr. L. 
D. Kaplan for helpful correspondence concerning intensity esti- 
mates for harmonic bands of COs. 

16 L. D. Kaplan, J. Chem. Phys. 18, 186 (1950). 


TABLE (A-I). Relative intensities for harmonic bands in the 5z 
region based on harmonic oscillator wave functions. 











5(nin240— 
giR(nind0— 5 (nin2!0—m1'n2'"'0) ni'nz'"’0) 
Transition ni'n2'"’0) at 300°K at 600°K 
00°0— 1110 1 1 1 
0110-1270 4 0.164 0.810 
01'0—20°0 0.98 0.0376 0.187 
01'0-12°0 0.18 0.00395 0.0196 
0270-130 6 0.0101 0.248 
0330-1440 8 0.000315 0.0507 
0270-2110 0.98 0.00155 0.0382 
10°0—21'0 1.8* 0.00238 0.0656 
11'0-+30°0 0.98 0.0000438 0.00638 








* Adjusted according to Benedict to concentrate most of the intensity 
of resonating bands in the highest frequency member (reference 15). 


polynomials first obtained by Schrédinger.'” In this 
manner it is easily shown that 


[B(nyn2'n3—nyn2""'n3) ?/[B(m0°ns—n,1'n3) | 
= R(nq!—>ny'"”) =[IR’!/(k+-D) 1] 


<k,R’ p-l pl’ —p-1 2 
1)? » (A 
x () 2 _ Med T )| An 


where k=(1/2)(m2—I), k’=(1/2)(m2'—?’), p= (1/2) 
(1+/’+1), and (%) denotes a binomial coefficient. 

By combining Eqs. (7) to (9) with Eq. (A1) the 
following useful result is obtained: 


a(2ynq'ns—nyn2! "'n3)/a (,0°ns—n1'n3) = 5 (n2'—n2'"’) 
= giR(n2'—n2!"’)v(nne'ns—nyny"’n3)/ 
v(2,0°23—11'n3) ]{exp—LW vy (m1,n2,n3,4) 
—Wy(m,0,n3,0) ]/RkT}{1—expl—Av(n2'— 
ng!”’)/kT]} X (1—exp[—hv(0"12)/kT J}. (A2) 


Numerical values of R(n2'—nz,'"’) calculated from Eq. 
(A1) are identical with Benedict’s estimates,!®* which 
utilized data given by Shaffer.'* 

Following Benedict we shall assume that resonance 
between 2v2 and »; is exact, an approximation which 
will not change the order of magnitude of the calculated 
results although it is not in accord with observed infra- 
red and Raman intensities. By utilizing Eqs. (A1) and 
(A2) relative intensities have been computed for a 
number of bands in the 5yz region at 300 and at 600°K. 
The results are summarized in Table (A-I). 


17 E. Schrédinger, Ann. Physik 80, 483 (1926). 
18 W._H. Shaffer, Revs. Modern Phys. 16, 245 (1944). 
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An equation for the radius of a crystal grown by the Czochralski technique is derived by considering the 
mechanical and thermal equilibrium at the liquid-solid interface. The results agree qualitatively with 


experimentally known functional dependencies. 


N growing a crystal by the Czochralski technique,'”? 
which consists of slowly withdrawing a seed crystal 
from a melt maintained near the melting point of the 
material, the size of the crystal is found to depend on 
the melt temperature and the velocity of crystal growth. 
In this note an equation is derived relating the crystal 
dimensions to the melt temperature, the growth ve- 
locity, the surface tension, and other physical constants. 
Then, from measurements on single crystals of ger- 
manium grown with varying temperatures and growth 
rates, values of the surface tension of germanium are 
calculated using the derived relation, and compared 
with directly measured values. 

First, there is a relation between the true crystal 
growth velocity V, and the velocity of seed crystal 
withdrawal V’, that must be demonstrated. The surface 
of the melt will be lowered as the crystal is grown; this 
velocity of lowering v, adds to the velocity with which 
the seed crystal is withdrawn in determining the 
crystal size. If x represents distance measured vertically 
along the axis of the crucible, this statement is repre- 
sented by the equation 


V (x) =0(x)+V". (1) 


The relative areas of the crucible and of the growing 
crystal determine 2, and if the crucible is of nonuniform 
cross section, v will be a function of the level height in 
the crucible. If Y (x) is the radius of the crucible at a 
vertical height x measured from the bottom of the 
crucible, and A(x) is the area of the liquid-solid inter- 
face, by applying conservation of mass to the system 
the following relation is easily obtained : 


, 


v(x) = —_—————__, (2) 
prmY (x)? 


psA (x) 


where pz and ps are the densities in the liquid and 
solid phases, respectively. This correction term can be 
significant in practice because the crystal size may be 
comparable to that of the crucible it is grown from. 
Next, consider the dynamical relations existing at the 
crystal-melt interface. Assuming the interface is planar, 


1 J. Czochralski, Z. physik. Chem. 92, 219 (1918). 


?H. E. Buckley, Crystal Growth (John Wiley and Sons, Inc., 
New York, 1951), p. 83. 


the seed, acting through the mechanisms of surface 
tension and liquid-solid cohesive forces, will lift the 
melt a small distance “a” above the liquid level. The 
calculation of these forces requires some knowledge of 
the shape of this small element of liquid which can be 
obtained by visual inspection of the region during the 
actual growth of the crystal. For the particular case of 
crystals grown from melts of germanium, an approxi- 
mate shape of the system is represented in Fig. 1, 
wherein typical values for the dimensions are r=1 cm, 
a~0.2 cm, b~0.2 cm. The forces acting upward on this 
element of liquid, which to first order can be considered 
as a right circular cylinder, are the surface tension at 
the periphery and the liquid-solid cohesive force. The 
tangential shape of the surface of the element, where it 
meets the melt in the crucible, causes all the surface 
tension forces at the lower surface to be horizontal so 
that the only forces acting downward on the element 
are its weight and the liquid-liquid cohesive forces. 
The discussion below will proceed with the following 
simplifying assumptions: 1. the liquid-liquid cohesive 
force is negligible; 2. the solid-liquid cohesive force is 
small; 3. the dimension b is small compared to 7 so that 
the weight of the liquid lifted above the level of the melt 
can be approximated by pzrgmr’a. Then, by equating 
the forces upward to those acting downward, the dis- 


6¢ 99 


tance ‘‘a’”’ is found to be for equilibrium 


2y 
a=—-, (3) 
pLrg 


in which y¥ is the surface tension, and g is the accelera- 
tion of gravity. 

Now the relation between V, 7, and melt temperature 
can be found by considering the heat transfer processes 
occurring at the interface. Let W=heat conducted 
away from the interface by the crystal per unit time, * 


AT 
W= eat, (4) 
x 


where Kg is the thermal conductivity of the solid, and 
AT/Ax is the temperature gradient at the interface. 
The heat arriving at the interface is that liberated by 
crystallization and that conducted from the melt. The 
heat loss by radiation from this small region of thickness 
a will be neglected so that the total heat brought to 


668 











oe > © Fe es ot 


yy & fF 


ee HY ea th 


St - ee 


ry AF Fy wr 





4 = ve \¥ 





CROSS SECTION OF 


the interface per second is 


T—Te 





K yrr? 


+Lprrrv, (3) 
a 


wherein Ky, is the thermal conductivity of the liquid 
phase, T is the melt temperature, 7» is the melting 
point of the material, and Z is the heat of fusion. By 
equating (4) and (5), substituting in the value of a 
from (3), and dividing through by 7’, the desired rela- 
tion is obtained, 


K rpg 


AT 
(T-—To)r+Lp.V=Ks—=W'. (6) 
27 Ax 





This relation shows clearly the primary dependence of 
the crystal radius, and hence its cross-sectional area, 
on the various variables. For example, with V and W’ 
constant, and neglecting supercooling, 


1 


r~— 


T—T, 
or, with (7—7T >) and W’’ constant, 
r~W'—Lp.V, 


so that as V increases, either by increasing the seed 
withdrawal rate or decreasing the crucible radius, r de- 
creases. Or, with (T—T >) and r constant the growth 
velocity 


V~K Ss. 


All of the above conclusions qualitatively agree with 
known experimental dependencies.’ The above deriva- 
tion assumes that V is less than the pertinent maximum 
Kristallisationgeschwindigkeit,* otherwise a true crystal 
will not be grown. The effects of supercooling, constitu- 
tional supercooling, and the incubation distance for 
cellular substructure‘ and of preferential crystal growth 
velocities have been neglected completely in the pre- 
ceding derivation. 

An explicit relation for r as a function of crucible 


3G. Tammann, Aristallisieren und Schmelzen (Verlag J. A. 
Barth, Leipzig, 1903). 

* Tiller, Jackson, Rutter, and Chalmers, Acta Metallurgica 1, 
428 (1953). 
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Fic. 1. Schematic rep- 











resentation of the ele- 
ment of liquid lifted by LiguiD | 
the seed crystal. 3 
i 
| 


. Shape can be obtained by substituting Eq. (2) in (6). 


Assuming pr=ps and defining Co= K rpzg/2y, then, 


Co(T—To) ; WwW’ 
{fh tora 


+[W’—Lp.V’j=0, 


the solution of which can be obtained from the cubic 
formula. 

An attempt was made to verify (6) by varying the 
pulling rate from 6 to 2 inches per hour in five zones, 
and the melt temperature over a 5°C interval of a 100 
gram single crystal of n-type germanium (having a 
resistivity of 3.5+0.5 ohm cm over the entire length). 
By measuring the radius of the crystal in each of the 
zones and solving the set of five simultaneous equations 
thus obtained for the unknown coefficients in (6), and 
assuming that K,~0.1 cal/sec cm°C and g=980 cm?/ 
sec, the value obtained for the surface tension at the 
melting point is y~340 dyne/cm. This is to be com- 
pared with the value y=600 dyne/cm recently ob- 
tained® by a more direct method. There are three 
uncertainties involved in the above calculation of y 
from (6), the effect of crucible shape was neglected, the 
value for K, was obtained by a perhaps unwarranted 
extrapolation of the observed® temperature variation 
of Kg, and in the solution of the set of simultaneous 
equations, the differences in crystal radii in the various 
zones rather than the radii themselves enter which 
reduces the accuracy of the results. Possibly, the value 
of K, may be higher than that assumed, which would 
tend to reduce the discrepancy. Further experimental 
work would be highly desirable to provide additional 
quantitative tests of Eq. (6). 





5 P. H. Keck and W. Van Horn, Phys. Rev. 91, 512 (1953). 
6 A. Grieco and H. C. Montgomery, Phys. Rev. 86, 570 (1952). 
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The internal damping of ferromagnetic wires was measured as a function of either a superimposed static 
stress or an alternating magnetic field. Similar to a static magnetic field, a static stress affects the damping 
considerably. For example, the logarithmic decrement of a Fe-40 percent Co wire in torsional vibration is 
reduced by a factor of 33 when a heavy weight is suspended on the wire. The effect of an alternating mag- 
netic field on the damping was studied by varying the frequency of the field. It is found that the damping 
reaches a strong minimum at a field frequency of the order of magnitude of 20 cycles/sec. Magnetic hysteresis 
loss measurements were made which explain this phenomenon in terms of magneto-mechanical hysteresis. 





INTRODUCTION 


T is well known that ferromagnetic materials exhibit 
relatively large internal damping or internal friction. 
This is mainly due to an effect, called magneto-mechan- 
ical hysteresis, which was discovered by Becker and 
Kornetzki 20 years ago.' According to this effect, 
ferromagnetic domains, when under the influence of an 
external mechanical stress, align themselves in a similar 
fashion as they do in a magnetic field. Since the align- 
ment of the domains is partly an irreversible process, 
energy is dissipated in realigning domains during a 
mechanical stress-strain cycle. The area of the magnetic 
hysteresis loop is thereby closely related to the area of 
the mechanical stress-strain hysteresis loop. From these 
considerations it is apparent that the damping of 
mechanical vibrations in ferromagnetic materials de- 
pends, to a large extent, on the magnetic hysteresis 
loop. 

Most of the previous work on magneto-mechanical 
hysteresis has dealt with the effect of a static magnetic 
field on the damping.'~* Evidently, the damping de- 
creases when the material is subjected to a static 
magnetic field. The domains are then fixed in the direc- 
tion of the field and are not able to move under the 
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Fic. 1. Schematic diagram of the apparatus. 


1 R. Becker and M. Kornetzki, Z. Physik 88, 634 (1934). 

2? Max Kornetzki, Wiss. Veréffentl. Siemens Werken 17, 410 
(1938). 

3 Bonfiglioli, Ferro, and Montalenti, Phys. Rev. 86, 959 (1952). 


influence of the vibrational stress. Thus, virtually no 
energy is dissipated, and the damping must be small. 
In the present paper, the effects of a static stress and an 
alternating magnetic field are investigated. The effect of 
a superimposed static stress is of practical importance 
since machine members that exhibit magneto-mech- 
anical damping‘ are often subjected to large centrifugal 
forces. The effect of an alternating magnetic field, on the 
other hand, is more of theoretical interest. While it is 
known that a 60-cycle field reduces the damping more 
than a static magnetic field,® a systematic study of this 
phenomenon has not yet been made. Nor to the author’s 
knowledge has this effect been explained. 


TaBLeE I. Composition and dimension of the test wires. 











Length Diameter 
No. %Fe %Cr %Co (in) (mil) 
1 87 13 ee 12.0 52.5 
2 60 tee 40 11.5 59.5 
3 99.98 tee tee 12.0 51.0 
4 65 tee 35 12.0 59.0 





EFFECT OF STATIC STRESS 


The apparatus used for the experiment is schematic- 
ally shown in Fig. 1. A test wire is clamped between two 
steel vises P. The upper pin vise is rigidly mounted 
while the lower pin vise is free to rotate. W is a weight 
resting on the coupling C. The moment of inertia of the 
system is kept constant when different weights are 
suspended on the wire by varying the weight of the soft 
iron pieces I on the long inertia arm A. Consequently, 
the frequency of vibration does not change during the 
measurements. A free torsion vibration is started by two 
electromagnets located in front of the two iron pieces /, 
below and above the paper plane. The decay of the 
vibration is measured optically by means of a light beam 
reflected from a mirror to a scale or to a moving 
film camera. In order to determine the residual damping, 
the test wires are placed in a solenoid S and can be 
subjected to a magnetic field of about 800 oersteds. The 


4A. W. Cochardt, Trans. Am. Soc. Mech. Engrs. 75, A196 
(1953). 
5 E. R. Parker, Trans. Am. Soc. Metals 28, 661 (1940). 
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brass bar B is immersed in oil so that lateral movements 
of the rod are damped quickly. 

Measurements were made on wires 1, 2, and 3, whose 
compositions and dimensions are listed in Table I. The 
carbon content of the Puron wire (No. 3) was 0.006 
percent. Alloys 1 and 3 were vacuum-melted, forged, 
and then swaged; alloy 2 was melted in air, forged, and 
centerless ground. Before the measurements were made, 
all wires were heat-treated at 900°C for two hours in a 
hydrogen atmosphere and were cooled at a rate of 
120°C/hr. X-ray photographs taken on alloy 2 indicated 
a slight preferred orientation of the surface of the wire. 

The results for wires 1 and 2 are shown in Figs. 2 
and 3 for a frequency of 0.4 cycle per second. The log- 
arithmic decrement 6, which is related to the total 
mechanical energy loss per cycle (AU), and the total 
strain energy U; according to the expression 


(AU), 
= (1) 


LOGARITHMIC DECREMENT 





MAXIMUM TORSION STRESS, PS! 


Fic. 2. Logarithmic decrement vs maximum torsion stress 
measured on an Fe-40 percent Co wire for various superimposed 
static tension stresses. 


is plotted against the maximum torsion stress Tm. It is 
evident from the figures that the static stress has a 
pronounced effect on the damping of the vibration. 
For example, at a maximum vibrational torsion stress of 
2500 psi, the logarithmic decrement is 0.20 for 290 psi 
and 0.006 for 10000 psi static tension stress for the 
Fe-40 percent Co wire (Fig. 2); in other words, the 
damping of the torsion vibration decreases by a factor 
of 33 in this range when a heavy weight is suspended on 
the wire. This is exemplified by Fig. 4 which shows the 
oscillogram for these two cases as recorded by the 
moving film camera. The effect of the static stress is 
similar for the other two wires, but not quite so strong 
(Figs. 3 and 5). 

The appreciable difference in the damping behavior 
of the wires, when weights are suspended on them, is 
explainable in terms of domain wall movement. Under 
the application of the static tension stress, ferromag- 
netic domains become aligned along one axis, and they 
remain aligned if the vibrational stress is small compared 


390 PSI 


3,900 PSI 


7,700 PSI 


LOGARITHMIC DECREMENT 


370 PSI, H= 450 Oe 





MAXIMUM TORSION STRESS, PS! 


Fic. 3. Logarithmic decrement vs maximum torsion stress 
measured on an Fe-13 percent Cr wire for various superimposed 
static tension stresses. 


to the static stress. Hence, essentially no energy is 
dissipated during the vibration, and the damping 
capacity is small. 


EFFECT OF AN ALTERNATING FIELD 


In a second series of measurements the effect of an 
alternating magnetic field on the internal damping was 
studied. The apparatus used for these measurements is 
identical with the one shown in Fig. 1 if the weight W is 
removed. Again, the decay of the free torsion vibration 
was measured. However, this time the two leads of the 
solenoid S were connected to an amplifier of a low-fre- 
quency oscillator. By this means not only the magnitude 
of the magnetic field, but also its frequency could be 
varied. Measurements were made on wire 4 (Table I) 
which was prepared and heat-treated like wire 2. 

Before discussing the results, it must be mentioned 
that it was found more convenient for this series of 
measurements to express the damping capacity by the 
mechanical energy absorbed in the material per cycle 
and unit volume for a given stress. This quantity, which 
is given by the area of the stress-strain hysteresis loop, 
is denoted hereafter by AU. It is directly measurable 
only if the stress does not vary throughout the material. 
Since this condition is not fulfilled for a solid wire 
twisted in torsion, AU must be derived from (AU). 
Both (AU), and AU are considered to be functions of 


01234 SEC 
eaeee 








TORSION STRESS, psi 


01234 SEC 
oe eee 


+o) 4 HKU 
oc TTT 





TORSION STRESS, 9: 


Fic. 4. The decay of a free torsion vibration of a Fe-40 percent 
Cowire. Above: with no weight; below: with a heavy weight sus- 
pended on the wire. 
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Fic. 5. Logarithmic decrement vs maximum torsion stress 
measured on a Puron wire. 


the stress only; they are then related® to each other 
through the expression 


mm dV 
(aUy= favav= f AU—dr, 
0 dr 
or (2) 


1 d(AU),) 
aU =—| (aU) ctr 


, 


for simple torsion if 2V (r/r,,*) is substituted for dV /dr. 
(V is the volume and r the torsion stress.) 

Figure 6 shows the values of AU derived in this 
manner as a function of the torsion stress. The magnetic 
field was kept constant at 6 oe while its frequency was 
varied. It is evident that AU, when plotted versus 
frequency, reaches a minimum for a frequency of the 
order of 20 cycles. Both at larger and at smaller fre- 
quencies the damping is higher. The increase of AU at 
higher frequencies is understandable since, due to the 
skin effect, the alternating magnetic field penetrates 
into the wire only to a certain depth. 

This depth is given’ by 





Tm 


_ (ee)! 
2m (fu)! 


where p is the resistivity, c the light velocity, f the 
frequency, and uw the permeability. Substituting into 
this -expression p=10-° ohm cm and y=10*, it is 
evident that 6 is only of the order of 10-* cm for a 
frequency of 2000 sec; in other words, a large part of 
the wire is not subjected to a magnetic field any more, 
and this causes the increase of the damping at larger 
field frequencies. The increase at low frequencies, on the 
other hand, is more difficult to explain. Already Parker 
has shown that a 60-cycle magnetic field reduces the 
damping more than a static magnetic field of the same 
magnitude.® But he was not able to give an explanation 
for this phenomenon. 

As was pointed out above, AU is related to the area 





6A. W. Cochardt, Trans. Am. Soc. Mech. Engrs. (to be pub- 
lished). 

7R. Becker and W. Déring, Ferromagnelisms (Springer-Verlag, 
Berlin, 1938), p. 230. 
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of the magnetic hysteresis loop designated here by 
AU,,. The experiments showed that AU decreases when 
an alternating field is applied. Evidently, AU, must 
then also become smaller, when measured with a super- 
imposed alternating magnetic field, if the damping of 
Fe-40 percent Co wire is actually caused by mag- 
neto-mechanical hysteresis. To check this assumption 
magnetic measurements, similar to the mechanical 
measurements, were carried out. The apparatus used 
for determining AU, is shown in Fig. 7. It is seen that 
the arrangement basically consists of three circuits, each 
connected to an Epstein frame E. The wire is replaced 
by Epstein strips. These have the composition Fe-35 
percent Co-0.5 percent Cr and received the same heat- 
treatment as the Fe-35 percent Co wire. A specially 
built low-frequency generator S consisting of an electro- 
lyte with two stationary and two rotating electrodes 
furnishes the necessary current with a constant fre- 
quency of 0.4 cycle per sec. AU, is determined from the 
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Fic. 6. Mechanical strain energy loss AU vs torsion stress r for 
various frequencies f of a superimposed alternating field of 6 
oersted. 


reading of the wattmeter W whose current connections 
are in the primary circuit, while its voltage leads are in 
the secondary circuit. In this manner the power loss in 
the primary circuit due to the ohmic resistance of the 
primary coil does not cause any reading of the watt- 
meter; and since eddy current losses do not noticeably 
occur at the applied frequency, the deflection of the 
wattmeter gives only the hysteresis loss AU». B, and 
Bz are parallel resonance circuits which prevent the 
alternating current, generated by the oscillator O and 
amplified by the amplifier P in the tertiary circuit, from 
entering the primary and secondary circuit. Analogously 
B; is a series resonance circuit which supresses feedback 
of the primary circuit into the tertiary circuit. The 
magnitude of the magnetic fields is determined from the 
reading of the ammeters A; and A». Both the watt- 
meter W and the Epstein frame E were described in 
considerable detail elsewhere.® 

Figure 8 gives the results of the measurements, again 
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for a constant maximum alternating field of 6 oersted. 
The energy loss AU,, is plotted versus H,,, which is the 
maximum 0.4-cycle magnetic field strength. The fre- 
quency of the superimposed magnetic field was thereby 
varied between 0 and 2000 cycles. It is seen that the 
curves have the same general shape as those in Fig. 6. 
Again, the superimposed static field reduces the energy 
loss by only a slight amount. The lowest curve is the one 
measured at a field frequency of 20 cycles which is also 
in agreement with the mechanical measurements. That 
AU.» increases at higher frequencies is again the result 
of the skin effect. No attempts were made to correlate 
mechanical and magnetic energy losses since this was 
done several times before" and since the two materials 
did not coincide completely. 

The described experiments explain in a simple way 
the effect of an alternating field on the damping of 
mechanical vibrations: A superimposed alternating 
field makes the magnetic hysteresis loop smaller; this in 
turn reduces the area of the stress-strain hysteresis 
loop, since loops are related to each other. The area of 
the stress-strain loop now determines the amount of 
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damping. Hence, an alternating magnetic field must 
reduce the damping of a mechanical vibration. 


SUMMARY 


Ferromagnetic wires were tested in torsion, and the 
decay of the free vibration was measured as a function 
of vibrational stress, static stress, and superimposed 
alternating magnetic field. The effect of a static stress 
was studied simply by hanging different weights on the 
wire and by measuring the logarithmic decrement of the 
vibration. It was observed that, similar to a static 
magnetic field, a static stress decreases the damping 


( oa Burgwin, Proc. Am. Soc. Testing Materials 41, 779 
1941). 

®M. Kornetzki, Z. Physik 121, 560 (1943). 

1 Max Kornetzki, Ann. Physik 2, 265 (1948). 

" William Fuller Brown, Jr., Phys. Rev. 75, 147 (1949). 


12 xio"5 


























Au, youLe/com? 



































0 10 20 30 40 50 60 
Hm, OERSTED 


Fic. 8. Magnetic hysteresis loss AU, vs 0.4-cycle field H», for 


various frequencies f of a superimposed alternating field of 6 
oersted. 


considerably. For example, the logarithmic decrement of 
a Fe-40 percent Co is reduced by a factor of 33 when a 
heavy weight is suspended on the wire. The result of 
these measurements agrees with considerations of 
domain wall movement. 

In the second part of this work, the effect of an alter- 
nating field was studied. A Fe-35 percent Co wire is 
subjected to an alternating magnetic field whose peak 
value is kept constant at 6 oersted while its frequency is 
varied between 0 and 2000 cycles/sec. When plotted 
versus the frequency of the field, a sharp minimum of the 
damping is found at about 20 cycles/sec. In order to 
explain this finding in terms of magneto-mechanical 
hysteresis, magnetic hysteresis loss measurements are 
made on Epstein strips of a Fe-35 percent Co, 0.5 
percent Cr alloy. Again, the superimposed magnetic 
field is kept constant at 6 oersteds while its frequency is 
varied. As expected, the magnetic hysteresis loss, when 
plotted versus the frequency of the superimposed field, 
reaches a sharp minimum at a frequency of the order of 
20 cycles/sec. Thus, the superimposed alternating mag- 
netic field reduces the area of the stress-strain loop as 
well as the area of the magnetic hysteresis loop, which is 
in complete agreement with the magneto-mechanical 
effect. 
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Electron Bunches of Short Time Duration* 
E. W. Ernst AND H. VONFOERSTER 
University of Illinois, Urbana, Illinois 
(Received February 4, 1954) 


OR many experimental purposes, it is necessary to have a 
means of generating bunches of electrons with a duration of 
about 10~" second. This problem occurred during an investigation 
of the time dispersion of secondary emission. For this particular 
purpose, an electron bunch of this duration, recurring at regular 
intervals, and of a relatively small current density was required. 
The method which was devised to generate these bunches is not 
overly complex and may be of interest to experimenters faced 
with a similar problem. 

The method of producing electron bunches of this short dura- 
tion consists of two main steps. First, a group of electrons is 
periodically selected by a process of chopping; second, this 
chopped group of electrons is then compressed into a bunch of 
the desired duration. A diagram of the method is shown in Fig. 1. 

In the chopping section, a dc electron beam is injected between 
the two wires of a Lecher wire pair,! which may be excited at a 
frequency of 3000 megacycles, causing the beam to be deflected 
across a chopping slit such that’ only during a definite portion of 
the rf period are electrons allowed to pass. To enable selection 
of a particular portion of the rf period, a pair of dc deflection 
plates is provided to direct the electron beam such that the desired 
portion of the trace will coincide with the chopping slit. 

The group of electrons selected in this manner is then passed 
through the compression section where a cavity, resonant at 
the same frequency of 3000 megacycles, velocity modulates the 
group of electrons preselected by the chopper. By suitable adjust- 
ment of the relative phase of the rf energy supplied to the chopping 
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Fic. 1. Schematic diagram of a system for producing 
electron groups of short time duration. 
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Fic. 2. Phase compression diagram for the compression section 
of the electron bunching system. 


deflector and that supplied to the compression cavity, the earlier 
electrons of the group will be decelerated and the later electrons 
will be accelerated so that all the electrons will arrive at the target 
plane nearly simultaneously. The quantitative aspects of this 
process are well understood as first-order bunching theory which 
is extensively treated elsewhere.” 

The theory indicates that, for values of the bunching parameter 
k, slightly greater than unity, two close bunches will be formed. 
Since most of the electrons are to be found at and between these 
twe bunches, in cases when the two bunches are extremely close 
together this group of electrons may, for all practical purposes, 
be considered as a single bunch. Because it is desired to include 
at the target plane only electrons which contribute to this single 
bunch, having a phase width of Ags, it is of interest to determine 
the phase spread Ag» over which these electrons may have been 
distributed originally at the modulation plane. This relationship 
may be theoretically determined by a simultaneous solution of 
three transcendental equations. This solution, carried out by a 
numerical method, is shown in Fig. 2 where the high compression 
of the original phase spread A go is clearly demonstrated. 

In order to analyze the proper functioning of the chopping 
and compression sections, both separately and together, an 
analyzing system was placed at the target plane. This analyzer 
consists of two uhf deflection systems, of the same type as that 
used for the chopping deflector, placed perpendicular to one 
another and excited such that a dc beam of electrons is caused 
to trace a complete circle on a fluorescent screen. The action 
of the analyzer is to transform an electrical phase element at the 
target plane into a geometrical deflection angle. Thus, if a group 
of electrons pass the target plane during a phase interval of, 
say, 17°, a 17° sector on the fluorescent screen will be observed. 

A set of typical traces obtained by operating the two sections, 
both separately and together, is shown in the photographs of 
Fig. 3(a) to (d). The first figure (a) shows the trace produced by 
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(c) 


(d) 


Fic. 3. An analysis of the current at the target plane of the electron bunching system under various conditions of operation: (a) 
without chopping or compression; (b) without chopping but with compression; (c) with chopping but without compression; (d) with 
both chopping and compression. (Original.diameter of circular trace about 6 cm. Circular sweep speed about twice the velocity of light.) 


a dc beam of electrons passing the target plane. Applying a small 
amount of velocity modulation (bunching parameter approxi- 
mately 1.2), results in a concentration of electrons in the bunch 
region and a scarcity of electrons one-half period away, as seen 
in (b). The action of the chopping section alone in preselecting a 
group of electrons is shown in (c). Finally the operation of both 
the chopper and the compressor produces a result which is shown 
in (d). The group of electrons which occupied a phase spread of 
about 60° at the modulating plane has been compressed into a 
bunch with a phase width of approximately 2°. Since the fre- 
quency of modulation used is about 3000 megacycles, a 2° phase 
spread corresponds to a duration of approximately 210~™" 
second. 


c * This study is sponsored by the U. S. Air Force Cambridge Research 
enter. 

1L. R. Bloom and H. M. VonFoerster, Beam Analyzer, Technical Re- 
port No. 5-3, U. S. Office of Naval Research Contract (unpublished). 

2 Karl R. Spangenberg, Vacuum Tubes (McGraw-Hill Book Company, 
Inc., New York, 1948), Chapter 17. 





An Effect Produced by Oblique Impact of a 
Cylinder on a Thin Target 


Wituiram A. ALLEN, Jo—E M. Mapes, AND WESLEY G. WILSON 


Michelson Laboratory, U. S. Naval Ordnance Test Station, 
Inyokern, China Lake, California 


(Received Februrary 8, 1954) 


IGHT circular steel cylinders have been fired at conventional 
ordnance velocities into thin plane lead targets. If the 
striking angle is zero or less than a certain critical angle 2¢, the 
front surface of the cylinder acquires negligible plastic deformation 
as a result of the impact. If the critical angle 2¢ is exceeded, the 
front surface of the cylinder is marked by a series of ridges, the 
spacing of which depends on the thickness of the target. 
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Fic. 1. End view of 0.50 caliber cylindrical steel projectile after it has 
passed through a 0.005-in. lead target at 45 deg striking angle. The actual 
diameter of this cylinder is 0.428 in. 


Figure 1 is a photograph of the end of a 0.50 caliber cylindrical 
steel projectile after it has penetrated a 0.005-in. lead target 
aligned at a 45 deg striking angle. Figure 2 is a photomicrograph’ 
of a cross section of a cylinder that shows the wave structure of 
the ridges. Although the mechanism responsible for production 
of these waves is somewhat obscure, the critical angle 29 is 
believed to be the same critical angle discussed by the Los Alamos 
group’ in a paper that deals with metal plates accelerated together 
by high explosive charges. The Los Alamos group has discussed 
the asymmetric collision of dissimilar solids, but has not yet re- 
ported any experimental data. The specimens shown in Figs. 1 and 
2 correspond to the asymmetric case. 

The experiment was modified to obtain symmetric collision. 
Steel projectiles with conical noses specified by the half-angle 
«/2—86 were fired into steel targets aligned at the striking angle 0. 
Plastic deformation occurs along one of the elements of the cone 
provided that 20>2¢. Negligible plastic deformation occurs if 
20<2¢. The critical angle 2¢ determined by this experiment is in 
excellent agreement with the predicted' value for iron. Two 
preliminary determinations indicate the value 2¢=7.7 deg for a 
projectile velocity of 0.87 mm/ysec. This velocity corresponds to 
the plate velocity »»=0.43 mm/ysec of Fig. 15 in reference 1. 

The experiment discussed is believed to be equivalent to that 
of the Los Alamos group. No theoretical or experimental difficulty 
is expected if the technique is extended to higher velocities and 
to solids other than steel. The experiment is expected to be of 
value in checking and determining equation of state data of 
solids in the megabar pressure regime. As a basis for comparison, 
the compressibility of pure iron has been measured up to a maxi- 





Fic. 2. Photomicrograph of a cross section of a projectile which shows 
the wave structure formed by 45 deg impact on a 0.010-in. lead target. The 
average distance from crest to crest is 0.033 in. 
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mum pressure of only 0.03 megabar.? The pressure produced 
at the critical angle 2¢ by the symmetric collision of steel is 
0.47 megabar, a value calculated from the published! equation of 
state of iron. 


' Walsh, Shreffler, and Willig, J. Appl. Phys. 24, 349 (1953). 
?P. W. Bridgman, Revs. Modern Phys. 18, 1 (1946). 





A New Silicon p-n Junction Photocell for Converting 
Solar Radiation into Electrical Power 
D. M. CuHaptin, C. S. FULLER, AND G. L. PEARSON 


Bell Telepkone Laboratories, Inc., Murray Hill, New Jersey 
(Received January .11, 1954) 


HE direct conversion of solar radiation into electrical power 

by means of a photocell appears more promising as a result 

of recent work on silicon p-n junctions. Because the radiant 

energy is used without first being converted to heat, the theoretical 
efficiency is high. 

Photons of 1.02 electron volts (A=1.2 microns) are able to 
produce electron-hole pairs in silicon. In the presence of a p-n 
barrier, these electron-hole pairs are separated and made to do 
work in an external circuit. All of the light of wavelength shorter 
than 1.2 microns is potentially useful for generating electron-hole 
pairs but the efficiency of energy conversion decreases for short 
wavelengths because the energy above the necessary 1.02 electron 
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Fic. 1. Normalized spectral energy distribution. (A) Silicon photocell 
equi-energy response. (B) Solar energy at earth's surface. (C) Curve A 
times Curve B. 


volts is wasted. Allowing for this loss and assuming a working 
voltage of 0.5 volt, which is near the maximum measured, a 
computation over the entire solar spectrum indicates a limiting 
efficiency of approximately 22 percent for a cell of negligible 
internal losses and for utilization of all possible electron-hole 
pairs. 

Several practical factors lower this figure. The untreated 
silicon surface reflects about half of the incident radiation. Some 
of this can be saved by proper surface treatment. The second 
serious loss is recombination of electron-hole pairs before they 
reach the p-n barrier. Penetration of radiation over most of the 
useful spectrum is extremely shallow so that it becomes necessary 
to place the p-m junction as near to the surface as possible except 
for the third serious loss. This is the J?R loss caused by resistance 
in the surface layer and by contact resistance. Extremely small 
cells minimize the resistance loss and give useful data. For cells of 
several square centimeters, special geometry of contacts will 
minimize resistance losses. 

Present work on silicon -n photocells uses a thin layer of 
p-type silicon formed over an n-type base. The surface layer is 
less than 0.0001 inch thick. Figure 1 shows the spectral response 
for one such cell. Curve A is the measured power output for equal 
intensities of weak radiation as a function of wavelength. Maxi- 
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mum sensitivity is arbitrarily taken as unity. Ideally the maximum 
should appear near 1.2 microns with half this sensitivity at 0.6 
and one-quarter at 0.3 micron. Curve B is the normal distribution 
of full sunlight! with the maximum expressed as unity. Curve C 
is the product of Curve A and Curve B again reduced to unity 
for maximum. This curve shows which part of the sun’s radiation 
is most useful for this particular cell. 

The photocells described here have been made to deliver power 
from the sun into a resistance load at the rate of 60 watts per 
square meter of photocell surface. This is approximately 6 percent 
efficiency and compares with a measured value of 0.5 percent on a 
commercially available photocell. The greatest over-all efficiency 
previously reported for direct conversion of solar radiation into 
electrical power is that of Telkes? using thermoelectric junctions 
and amounts to 1 percent. 

We wish to thank H. B. Briggs who made the spectral measure- 
ments shown in Curve A. 

!W. E. Forsythe, Measurement of Radiant Energy (McGraw-Hill Book 


Company, Inc., New York, 1937). 
2 Maria Telkes, J. Appl. Phys. 18, 1116 (1947). 





Dissected Amplifiers Using Negative Resistance 
W. SHOCKLEY AND W. P. Mason 


Bell Telephone Laboratories, Inc., Murray Hill, New Jersey 
(Received December 28, 1953) 


T is the purpose of this communication to point out a line 
of attack upon the problem of making high-frequency semi- 
conductor amplifiers. This attack may be described as the dissec- 
tion of the amplification mechanism into its two constituents: 
negative resistance and directionality. 

Negative resistance is a common feature of all amplifying 
devices operating from a dc power source. Thus the net ac power 
from the device when operating as an amplifier is positive. Since 
this power is the net flux of the ac part of Poynting’s vector, it 
follews that Poynting’s vector must have a positive divergence 
throughout certain “negative resistance regions.” For a vacuum- 
tube triode amplifier, the grid-plate space is such a region, the 
current being high when the voltage is low and vice versa. Simi- 
larly, the space-charge region of the collector junction of a junction 
transistor amplifier has negative resistance. It should be noted 
that these negative resistances are not characteristic of the regions 
per se but instead of their behavior in an amplifier circuit. 

This method of analysis enables one to evaluate quickly such 
proposals as the making of a junction transistor from isolated 
p-n junctions (impossible because no negative resistances can 
occur) or an amplifier from a nonlinear dielectric and dc power 
source. 

Directionality arises in vacuum tubes and transistors because 
of the separation of input and output circuits. This leads to 
asymmetry in the current voltage matrix and permits the achieve- 
ment of high gain by cascading stages without resultant instability. 

Because the drift velocity of electrons (or holes) in semiconduc- 
tors is smaller than the average transit velocity in vacuum tubes, 
the physical dimensions must be smaller for the same transit 
times. Although the solid nature of transistors gives them an 
inherent structural advantage over vacuum tubes, the smaller 
scale required for them poses serious problems of construction 
for devices in the highest-frequency ranges. 

One way around this difficulty is to make two-terminal negative 
resistance devices in which only one dimension need be small. 
A variety of operating principles are possible for the operation 
of such devices." 

Two-terminal negative resistance devices have the disadvan- 
tage, however, that they do not have directionality ; consequently, 
high gain js associated with narrow margins of stability. 

This limitation may be overcome by adding directionality 
through passive elements? such as Hall effect couplers or gyrators* 
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Fic. 1. Hall effect plate and three resistances. (Power supplies and con- 
nections necessary to power the negative resistances are not shown.) 





in wave guides.‘ Such combinations can simulate conventional 
amplifying devices such as a vacuum-tube triode. In Fig. 1 the 
hexagon represents a germanium plate with a magnetic field 
perpendicular to its plane. It contributes in Eqs. (1) and (2) the 
terms involving the conductance yo and the dimensionless quantity 
a which is an odd function of the Hall angle with |a| <1: 


= (yot yi +y2) Vi-[4(1—a) yo+y2 V2, (1) 
I2= —[4(1+a) yo+¥2]Vi+ (yot+ye+ys) V2. (2) 


The conductances y:, y2, and ys; may correspond to negative 
resistance elements. For example, if yi, ye, and ys have the values 


y1=¥s= — (1+a)¥0/2, (3) 

y2= (a—1)y0/2, (4) 
then Eqs. (1) and (2) reduce to 

1,=0 and J2=ayoV 1. (5) 


This corresponds to an ideal pentode for a>0O and to such a 
pentode with voltage reversing transformer for a<0. Thus the 
circuit can simulate conventional amplifiers and in addition 
produce new types of devices. 

Values of a as high as 0.2 can be obtained in n-type germanium 
at room temperature at fields of 17 500 oersteds. Similar reasoning 
to that discussed here applies also to the four-terminal Hall 
effect plate or balanced gyrator.’ 

A simple test® of this principle was made at low frequencies 
by putting two gas diodes with negative resistance characteristics 
on either side of the balanced gyrator of reference 3. With this 
arrangement a gain of 6 db was obtained in one direction, and a 
loss of 46 db in the other. By more careful balancing of the negative 
resistances with the input and output resistances of the gyrator, 
higher gains are possible without instability. 


1A guide to one method of designing such semiconductor devices is 
furnished by the work of F. B. Llewellyn on transit time vacuum-tube 
diodes: Bell System Tech. J. 13, 59-101 (1934); 15, 575-586 (1936). 
Methods of extending this approach to semiconductors have been indicated 
by W. Schockley, U. S. Patent 2,623,102 (Junction Transistor). Another 
physical effect considered by one of us (W.S.) in unpublished work is the 
negative differential mobility to be expected for holes in very high electric 
fields. Interest in this predicted effect was the provoking cause of the 
research on high field mobilities, see E. J. Ryder and W. ye Phys. 
Rev. 81, 139 (1951) and E. J. Ryder, Phys. Rev. 90, 766-769 (1953). The 
theory of this effect has been independently discovered by H. Krémer, Z. 
Physik 134, 435 (1953), who calls it the ‘ ‘Staueffect” but does not consider 
its utilization as a power source. 

2 Passive elements can have the desired directionality, i.e., lead to unsym- 
metrical poy ” venom matrices, only in the presence of m rary fields. 
For a proof see H Casimir, Revs. Modern Phys. 17, 343-350 (1945). 

* Mason, Hewitt, and Wick, J. Appl. Phys. 24, 166-175 (1953). The fact 
that transmission through a crystal is nonreciprocal in the presence of a 
magnetic field was first pointed out by E. M. McMillan, J. Acoust. Soc. 
Am: 19, 922 (1947). 

C. L. Hogan, Bell System Tech. J. gt. 1-32 (1952). 

} Tats test was made by W. H. Hewi 
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On the Interaction of Two Plane Shocks Facing 
in the Same Direction 


Otto Laporte AND E. B. TURNER 


Randall Laboratory of Physics, University of Michigan, 
Ann Arbor, Michigan 


(Received January 4, 1954) 


HE problem of the coalescence of two shocks has been given 

considerable theoretical attention in the literature. First 
treated by von Neumann! and by Courant and Friedrichs,? this 
case of shock interaction was calculated in considerable detail 
by Bull, Fowell, and Henshaw.’ The time sequence of events is 
as follows. First there must be two shocks separated by a region 
of constant flow; at the moment when the rear shock reaches the 
front shock, as it always must, there is formed a stronger shock of 
coalescence proceeding in the same direction as well as an interface 
which moves with the flow and a centered rarefaction which 
moves backwards. For + larger than 5/3, a case which will not 
be considered further here, the rarefaction may be replaced by a 
weak shock facing in the direction opposite to the amalgamated 
shock. 

As far as we know this phenomenon has up to now escaped 
observation in shock tubes. We should like to report a method 
which makes it possible to create two equally facing shocks 
separated by a constant state. This is accomplished by blocking 
the test section with a comblike stack of plates spanning the width 
of the tube (see Figs. 1 and 2). Evidently the wedgelike tips 
will, when struck by the incident shock, give rise to cylindrical 
reflected shocks which quickly, by Huygens principle and Mach 
interaction, form the first plane shock. In the meantime segments 
of the primary shock travel down into the interstices, are reflected 
at the ends, and on emergence form a second plane shock. The 
strengths of these two similarly facing shocks may evidently be 
varied between wide limits by (a) varying the strength of the 
primary shock and (b) varying the ratio of the thickness of the 
plates to that of the interstices. 

The two interferograms shown below were taken with the 
University of Michigan 2- by 7-inch shock tube, using the 9-inch, 
30°-60° Mach-Zehnder interferometer. The gas used was argon, 
and the shock strength or pressure ratio of the incident shock 
was 12. Seven plates separated by six spacers of j-inch thickness 
were employed. The first interferogram, taken with white light, 
shows two perfectly formed shocks‘ proceeding away from the 
stack of plates. The fringe shifts indicate that they possess 
approximately the same density discontinuity and that they are 





_ Fic, 1. First interferogram showing the two shocks before amalgama- 
tion. The white marks identify the central fringe of the white light fringe 
pattern. 





Fic. 2. Second interferogram taken after interaction, 


separated by a region of constant flow. The second interferogram 
shows at the extreme left the amalgamated shock and its larger 
fringe shift. Somewhat to the right the interface with its minute 
fringe shift may be seen. According to the theory, a rarefaction 
should be present between the interface and the reflecting device, 
but it is not discernible in our flow pictures. The reason for this 
is twofold: First, for the shock strengths used here, the density 
change across this rarefaction is small compared with the density 
change across the interface (which itself is small in comparison 
to the density change across the resulting shock). Secondly, 
inasmuch as plate-glass windows were used on the shock tube, it 
was not intended to observe the phenomenon in its ultimate 
refinement at this point. The density changes inferred from the 
fringe shifts are in accord with the theory witbin our expeiimental 
error. Schlieren photographs, taken of the same process, show the 
interface with extraordinary sharpness and clarity. 

This work was partially supported by the U. S. Office of Naval 
Research under Contract No. N6-onr-232, T. O. 4, with the 
Engineering Research Institute of the University of Michigan. 

1J. von Neumann, Progress Report on the Theory of Shock Waves. 
National Defense Research Committee; Division 8, Office of Scientific 
Research and Development No. 1140, 1943. 

2 R. Courant and K. O. Friedrichs, Interaction of Shock and Rarefaction 
Waves in One-Dimensional Motion. National Defense Research Committee, 
Applied Mathematics Panel Report 38.1R (Applied Mathematics Group— 
New York University No. 1), 1943. 

+ Bull, Fowell, and Henshaw, The Interaction of Two Similarly Facing 
Shock Waves. Institute of Aerophysics, University of Toronto, UTIA 
Report No. 25, 1953. 

4 Since the plate stack did not span the height of the tube entirely, but 
left slightly more than 1 inch at top and bottom, the shocks show a slight 


curvature, especially at their ends and at the larger distances from the 
reflecting plates. 





High-Speed Magnetic Pulsing of Ferrites 
R. Conway LeCraw 
Diamond Ordnance Fuze Laboratories, Washington, D. C. 
(Received December 18, 1953) 


NUMBER of papers have reported Faraday rotations of 

microwaves using ferrites in wave guides.'* Work has 
been done by Bryan* at The Naval Research Laboratory to 
determine the limiting speeds of rotation for a given ferrite in 
connection with high-speed microwave switches. He found that, 
for a particular NiZn ferrite and method of pulsing, the minimum 
switching cycle is 0.8 usec. This limitation was attributed primarily 
to the recovery time of the ferrite. 








pi 
to 
of 











ram 
rger 
lute 
tion 
‘ice, 
this 
sity 
sity 
ison 
dly, 
e, it 
late 


ntal 
the 


aval 
the 


aves. 
tific 
ction 
ttee, 
ip— 
cing 
TIA 


, but 
light 
| the 


s of 
has 
y to 
e in 
hat, 
num 
arily 





LETTERS TO 





Fic. 1. (top). Video output of detecting crystal, max rotation =90°. The 
1.5-db point =one-half the first vertical division. 

Fic. 2. (middle). Current J in the coil, max =12 amp. 

Fic. 3. (bottom). Video output of detecting crystal, max rotation >90°. 
The same external triggering pulse was used for all three figures. The time 
scale =0.025 yusec per division. 


Experiments on magnetic pulsing of ferrites at this laboratory 
have resulted in a switch with a recovery time of 0.012 usec to 
the 1.5-db point and a switching cycle of 0.025 usec between 1.5- 
db points. The insertion loss is less than 1 db, and the attenuation 
in the switch-off state is greater than 35 db. 

A rod of a MgMn ferrite is placed along the axis of a section 
of circular wave guide propagating a TE\, wave at X band. A 
longitudinal magnetic field produces a rotation of a linearly 
polarized wave due to the difference in velocities of the two 
oppositely rotating circularly polarized waves of which the linear 
wave is composed.‘ Rotations of 0° and 90° correspond, respec- 
tively, to maximum and minimum transmission into a section of 
rectangular wave guide terminated with the detecting crystal. A 
piece of resistance card just preceding this section and parallel 
to its long dimension absorbs the microwave energy in the switch- 
off state. 

An eight-turn coil is wound about the circular wave guide 
containing the ferrite. A thin axial slit in the guide wal] prevents 
the wall from acting as a shorted turn to the coil. The current J 
in the coil is produced by a hydrogen thyratron pulser and is 
monitored by a synchroscope across a 3-ohm coaxial-type resistor 
in series with the coi]. This type resistor has negligible inductive 
reactance up to several hundred Mc. The current pulse of Fig. 2 
produces the video output pulse of Fig. 1 when the maximum 
amplitude of J is adjusted to produce 90° rotation. The peak 
magnetizing field is about 18 oersteds. In Fig. 3 a larger amplitude 
of J produces more than 90° rotation. Some ringing is noted in 
the current pulse. This can be eliminated if desired. However 
in these experiments the recovery time of the ferrite switch is of 
principal interest. 

An interesting and somewhat unexpected result is that the 
ferrite magnetization (to the first approximation proportional 
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to rotation) follows the magnetizing current quite closely. No 
appreciable time delay in the magnetization process is evident. 
The time delay of the entire output pulse with respect to the 
current pulse is due to the length of wave guide between the 
switch and the detecting crystal. This was checked by using 
various lengths of wave guide. 

Experiments are in progress to study the response of the mag- 
netization of various ferrites as a function of pulse amplitude and 
length with the latter varying from 0.01 to 0.5 usec. 

sont G. Sakiotis and H. N. Chait, Proc. Inst. Radio Engrs. 41, No. 1 
OL. Hogan, Bell System Tech. J. XXXI, No. 1 (1952). 


3 J. W. Bryan (private communication). 
4R. C. LeCraw, Natl. Bur. Standards Rpt. No. 2313, March, 1953. 





Crossed Electron Beam Technique for Measuring 
Space-Charge Effects in Beams* 


Paut D. CoLEMAN, Murray D. Sirkis, AND IRVING KAUFMAN 


Electrical Engineering Department, University of Illinois, 
Urbana, Illinois 


(Received December 28, 1953) 


PACE-CHARGE problems associated with the production and 
bunching of high current density electron beams occupy the 
attention of many workers in the electronics field. The crossed 
electron beam technique! shows promise as a method of studying 
space-charge effects in beams in greater detail and with more 
additional information than, for example, the iris technique of 
Linder and Hernquist.? 

The method, as illustrated in Fig. 1, is to probe the high current 
density test beam by a low voltage swept probe beam and observe 
the Coulomb scattering on a fluorescent screen. Under the condi- 
tions of our experiment, it can be shown that the semiscattering 
angle 4 is given by 


~ 7 (z) 
. 16€mon¢?’ 


(1) 


where o(z) is the charge per unit length of the test beam and 2p, 
the velocity of the probe beam at the point of crossing. Thus 
the deflection is proportional to the net charge per unit length, 
ie., electron density minus positive ion density, and hence can 
be used to evaluate directly the positive ion density as a function 
of time. 

The rate of formation of ions per cm® in an electron beam is 
given by the expression 


dns _ pP(V)min, (2) 


dt 
where ? is the gas pressure, m, the electron density, 7; the electron 
velocity, and P(V) the number of ion pairs formed by an electron 
per unit length at unit pressure® (a function of voltage V). 
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To Synchroscope 








Fic. 1. Crossed electron beam arrangement. 
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, 
Fic. 2, Effect of pressure on scattered probe 
beam for 5.97-kv electrons. 


Figure 2 shows the effect of pressure on the scattered probe 
beam resulting from a high current density pulsed test beam and 
the actual current and voltage pulses measured on a synchroscope. 
The product m% is readily calculated from the synchroscope 
traces as follows: 
with 2,=5.93X1074/V cm/sec, (3) 


nv : 

171 gA 

where A (0.32 cm?) is the mean cross-sectional area of the test 

beam. The derivative dn2/dt is determined from the initial slope 
m of the scattered trace as follows: 
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Fic. 3. Effect of pressure on the rate of fermation 
of ions in electron beam. 
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where Ay=divisions initial vertical displacement and s = divisions/ 
second. 

A plot of the data taken from photographs, such as those of 
Fig. 2, is given in Fig. 3. A value of P(V) computed from the 
curves is also given for three different voltages. 

The scattered beam traces indicate that the test beam never 
becomes completely neutralized by the positive ions. The final 
steady-state condition will be a function of the geometry of the 
tube and any ion sweeping fields that may be present. 

* Work supported in part by Air Force Cambridge Research Center on 
Contract AF 19(604)—524. 

t P, D. Coleman and M. D. Sirkis, J. Appl. Phys. 21, 952 (1953). 

2 £. G. Linder and K. G. Hernquist, J. Appl. Phys. 21, 1088 (1950). 


3 Engel and Steenbeck, Elektrische Gasentladungen (Berlin, 1932). [P(V) 
for O: extrapolated from given curves. ] 





A Combined Slow-Wave Circuit and Beam 
Focusing Structure 
PuHILip RICE 
Stanford Research Institute, Stanford, California 
(Received January 7, 1954) 


HE “strong focusing effect”! has been shown to apply 


equally well to low-velocity electron beams in the presence 
of space charge.2? Such beams have been focused in periodic 
axially symmetrical magnetic fields with considerable savings in 
weight of the permanent magnetic structure. The field was 
produced by means of a series of ring magnets. 

A familiar slow-wave circuit used in traveling wave oscillators 
and amplifiers consists of a series of interlocking “fingers” pro- 
truding from opposite walls of a wave guide. An axial hole permits 
the electron stream to travel down the device. Such a structure 
can be made to produce a periodic magnetic field which will focus 
the electron stream. 

In order to test the idea, a short section consisting of five 
fingers was built. Figure 1 shows the circuit with the beam tester 
in place. All the fingers except for the first are 1.75 in. long, 
1.5 in. wide, 0.25 in. thick, and 0.25 in. apart. To obtain correct 
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magnetic field starting conditions, the first finger was made 2.5 in. 
long and 2.25 in. wide and a steel cylinder 0.75 in. long was 
attached to it. The hole is 0.5 in. in diameter. A Pierce type 
electron gun which produces 14 ma at 1000 v directs a beam hav- 
ing a minimum diameter of 0.07 in. down a copper cylinder 2.25 
in. long and 0.18 in. in diameter. The magnetic field between the 
fingers is supplied by four iron-cored solenoids along the edges of 
the structure. The field could equally well be supplied by two 
U-shaped permanent magnets running the length of the circuit. 
The entire device is made from mild steel. 

Figure 2 shows collector current as a function of peak magnetic 
field in the first gap. The “band pass” type of transmission is 
evident. Higher field bands were not obtained because of satura- 
tion of the iron. The Brillouin value of field for a 0.10-in. diameter 
beam at 750 v and 7.8 ma is 110 gauss. 

With appropriate rf shielding, this type of magnetic circuit can 
be used to focus helix-type tubes without the limitation on 
permanent magnet length inherent in a structure of ring-shaped 
magnets and pole pieces. 


! Livingston, Courant, and Snyder, Phys. Rev. 88, 1190-1196 (1951). 
2J. R. Pierce, J. Appl. Phys. 24, 1247 (1953). 
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American Physical Society—-Members of the 
Executive Committee of the Division of Fluid 
Dynamics for 1954 


Chairman: Francois N. Frenkiel, Applied Physics Laboratory, 
The Johns Hopkins University. 

Vice-Chairman: Otto Laporte, University of Michigan. 

Secretary-Treasurer: Joseph O. Hirschfelder, University of 
Wisconsin. 

Members: Walker Bleakney, Princeton University. S. Chan- 
drasekhar, Yerkes Observatory, University of Chicago. Bernard 
Lewis, Combustion and Explosive Research, Inc., Pittsburgh, 
Pennsylvania. Hans W. Liepmann, California Institute of 
Technology. 


High-Vacuum Symposium 


The Committee on Vacuum Techniques announces that a 
High-Vacuum Symposium will be held at the Berkeley Carteret 
Hotel at Asbury Park, New Jersey, on June 16, 17, and 18, 1954. 

Papers are being selected for presentation in five major cat- 
egories, which include: 1. Panel on nomenclature and standards; 
2. new equipment and instruments; 3. fundamental developments 
in vacuum technology; 4. methods and techniques; and 5. applica- 
tions and processes. Dr. K. C. D. Hickman, will be Honorary 
Chairman of the Symposium. 

The Committee on Vacuum Techniques has been organized as 
a nonprofit corporation. Its prime purpose is to sponsor the 
high-vacuum symposium. Other aims are to further the dissemina- 
tion of knowledge pertaining to high-vacuum techniques and to 
assist in the establishment of universally acceptable nomenclature 
and standards. 

All individuals and companies interested in attending the 
Symposium, presenting papers, or participating in other activities 
of the Committee on Vacuum Techniques, are urged to write to 
P. O. Box 1282, Boston, Massachusetts. 


International Colloquium on Oxide Cathodes 


Fifty years ago the first paper dealing with oxide coated 
cathodes was published. To celebrate the fiftieth anniversary, 
the Société Francaise des Ingénieurs Techniciens du Vide is 
organizing an International Convention to be held in Paris on 
June 24 and 25, 1954. Every prominent specialist is expected to 
attend this convention. 

Papers presented within the scope of the program may consist of 
original material or re-edited drafts of valuable documents. 
Papers accepted by the Organizing Committee will be presented 
by the authors or a summary will be read by the Secretary at the 
sessions of the convention. It is requested that these papers be 
sent to us, in their definitive form, no later than May 15, 1954. 
The texts will be published in a special issue of the periodical 
Le Vide, the official bulletin of the S.F.I.T.V. in their original 
language with, if possible, a French translation. Each author will 
receive, free of charge, a full copy of the Convention Proceedings, 
with 20 reprints of his own contribution. 

For information, please apply to the Société Francaise des 
Ingénieurs Techniciens du Vide, 44, rue de Rennes, Paris VI°, 
France. 

Summary of Program: I—Basic metal; II—carbonates and 
coating process; I[I—emission theory of oxide coated cathodes; 
IV—definition of the cathode properties; V—special cathodes; 
and VI—stability of emissive properties. 


International Conference on Semiconductors 


The Netherlands Physical Society, with the support of the 
International Union of Pure and Applied Physics and U.N.E. 
S.C.O. will organize an International Conference on Semicon- 
ductors, to be held at Amsterdam from June 29th-July 3rd, 1954. 

The following’scientists (two of them with some reserve) will 
deliver lectures: J. Bardeen, W. H. Brattain, H. B. G. Casimir, 
F. A. Kréger, D. Polder, M. Schén, W. Shockley, R. A. Smith, 
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H. J. Vink, H. Welker on subjects such as bulk recombination; 
surface conductivity; surface trapping; surface recombination; 
intermetallic compounds, the band picture in polar and nonpolar 
semiconductors; photoconductivity in semiconductors like PbS, 
PbTe, PbSe, ZnS, CdS; the application of general physical and 
chemical] laws for the preparation of semiconductors with specific 
properties. 

Discussions will be held in connection with these main lectures 
and there will be opportunity to discuss problems in more detail 
in sectional meetings. In these sectional meetings short communica- 
tions of about 15 minutes can be given. 

Scientists who would like to participate in the conference or 
want to give a scientific contribution should communicate this 
to the Secretary (Dr. H. J. Vink, Floralaan 142, Eindhoven/ 
Holland) as soon as possible. They will receive a detailed program 
in due course. 


Conference on Defects in Crystalline Solids 


The H. H. Wills Physical Laboratory of the University of 
Bristol, in cooperation with the International Union of Pure and 
Applied Physics (particularly its Commission for the Physics of 
the Solid State) and with The Institute of Physics, is organizing a 
conference on Defects in Crystalline Solids to be held from 
July 13-July 17, 1954, in Bristol. While not excluding other 
subjects in the field, the organizers propose to give particular 
attention to defects such as dissolved atoms, vacancies, and F 
centers, to microwave resonance methods of investigating their 
properties, and to the way in which they react with dislocations. 
Thus dislocations will be discussed in their chemical aspects, as 
influencing diffusion and precipitation in the solid state, rather 
than in relation to plastic flow. 

It is hoped that a number of authors from overseas will per- 
sonally present their papers, and with this in mind the conference 
has been arranged to follow immediately after the General 
Assembly of the Internationa] Union of Pure and Applied Physics. 

Board and lodging will be provided in Wills Hall, a student hall 
of residence, on special terms, or at hotels. 

The conference is open to any scientist interested in this field, 
subject to the limitations of seating accommodation. 

Further particulars may be obtained from the Secretary, 
H. H. Wills Physical Laboratory, Royal Fort, Bristol 8, or from 
the Secretary, The Institute of Physics, 47, Belgrave Square, 
London S.W.1. 

Those wishing to attend the conference are asked to apply to 
the former, marking the envelope “1954 Conference,” stating 
whether they wish to be accommodated at Wills Hall or at a hotel 
and for what nights accommodation is required. 


International Union of Crystallography 


As already announced, the Third General Assembly and Inter- 
national Congress of the Union will be held in Paris from July 21 
to July 28, 1954. Full details of the meeting are now available in 
the First Circular, copies of which may be obtained from the 
General Secretary of the Union (R. C. Evans, Crystallographic 
Laboratory, Cavendish Laboratory, Cambridge, England), from 
the Secretary of the Programme Committee (A. J. Rose, Labora- 
toire de Minéralogie, 1 rue Victor Cousin, Paris 5, France) or 
from any of the following regional representatives of the Pro- 


ANNOUNCEMENTS 


gramme Committee: G. Menzer (Universitatsinstitut fiir Kristal- 
lographie und Mineralogie, Luisenstrasse 37/II, Miinchen 2, 
Germany), W. H. Taylor (Crystallographic Laboratory, Caven- 
dish Laboratory, Cambridge, England), J. D. H. Donnay (The 
John Hopkins University, Baltimore, Maryland, U.S.A.). 

At the congress papers will be presented on all aspects of 
crystallographic research; there will also be an exhibition of 
crystallographic apparatus and books. After the congress two 
specialized symposia will be held on “The Location and Function 
of Hydrogen” and “The Mechanism of Phase Transitions in 
Crystals,” and there will be visits to localities of mineralogical 
interest. Offers of papers for the congress and symposia and notice 
of enrollment must reach the Secretary of the Programme Com- 
mittee (preferably on the form accompanying the First Circular) 
by February 15, 1954. All genera] correspondence should also be 
addressed to the Secretary of the -Programme Committee. 

See September issue of Acta Crystallographica for full details. 


Cooperative Graduate Summer Sessions 
in Statistics 


Beginning in 1954, North Carclina State College, the University 
of Florida, Virginia Polytechnic Institute, and the Southern 
Regional Education Board will jointly sponsor cooperative 
Graduate Summer Sessions in Statistics. The first session will be 
conducted by a distinguished faculty at Virginia Polytechnic 
Institute in the summer of 1954 and will run from June 9 through 
July 17. 

Inquiries should be addressed to Boyd Harshbarger, Head, 
Department of Statistics, Virginia Polytechnic Institute, Blacks- 
burg, Virginia. 


Short Courses on High-Temperature Properties of 
Materials and Mechanics of Creep 


A short course on High-Temperature Properties of Materials 
will be held at The Pennsylvania State University from June 
21 to June 25 inclusive. Following this course one, on Mechanics 
of Creep will be conducted between June 28 and July 2. Authorities 
from industrial, governmental], and educational institutions will 
give the lectures for both these courses. The programs are planned 
to give a broad coverage of high-temperature properties of 
materials and mechanics of creep by including physical, metallur- 
gical, and design aspects of these subjects. For further information, 
please write to Dr. Joseph Marin, Department of Engineering 
Mechanics, The Pennsylvania State] University, State_College, 
Pennsylvania. 


Course in Modern Industrial Spectroscopy at 
Boston College 


Boston College has announced a special two weeks intensive 
course in Modern Industrial Spectroscopy at Chestnut Hill, 
Boston, Massachusetts, from July 12 to July 23. The course is 
particularly designed for chemists and physicists from industries 
in the process of installing spectrographic equipment. Information 


on the course can be obtained from Prcefessor James J. Devlin, — 


S.J., Physics Department, Boston College, Chestnut Hill 67, 
Boston, Massachusetts. 
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